Anti-bacterial methods and materials

ABSTRACT

Staphylococcus aureus  virulence genes are identified, thereby allowing the identification of novel anti-bacterial agents that target these virulence genes and their products, and provision of novel  S. aureus  mutants useful in vaccines.

This application is a continuation-in-part of U.S. application Ser. No.08/887,534 filed Jul. 3, 1997.

The invention relates generally to the identification of genesresponsible for the virulence of Staphylococcus bacteria, therebyallowing the identification of new anti-bacterial agents that targetthese virulence genes and their products and the provision of novel S.aureus mutants useful in vaccines.

BACKGROUND OF THE INVENTION

The staphylococci, of which Staphylococcus aureus is the most importanthuman pathogen, are hardy, gram-positive bacteria that colonize the skinof most humans. Staphylococcal strains that produce coagulase aredesignated S. aureus; other clinically important coagulase-negativestaphylococci are S. epidermidis and S. saprophyticus. When the skin ormucous membrane barriers are disrupted, staphylococci can causelocalized and superficial infections that are commonly harmless andself-limiting. However, when staphylococci invade the lymphatics and theblood, potentially serious complications may result, such as bacteremia,septic shock, and serious metastatic infections, including endocarditis,arthritis, osteomyelitis, pneumonia and abscesses in virtually anyorgan. Certain strains of S. aureus produce toxins that cause skinrashes, food poisoning, or multisystem dysfunction (as in toxic shocksyndrome). S. aureus and S. epidermidis together have become the mostcommon cause of nosocomial non-urinary tract infection in U.S.hospitals. They are the most frequently isolated pathogens in bothprimary and secondary bacteremias and in cutaneous and surgical woundinfections. See generally Harrison's Principles of Internal Medicine,13th ed., Isselbacher et al., eds., McGraw-Hill, New York (1994),particularly pages 611-617.

Transient colonization of the nose by S. aureus is seen in 70 to 90percent of people, of which 20 to 30 percent carry the bacteria forrelatively prolonged periods of time. Independent colonization of theperineal area occurs in 5 to 20 percent of people. Higher carriage ratesof S. aureus have been documented in persons with atopic dermatitis,hospital employees, hospitalized patients, patients whose care requiresfrequent puncture of the skin, and intravenous drug abusers.

Infection by staphylococci usually results from a combination ofbacterial virulence factors and a diminution in host defenses. Importantmicrobial factors include the ability of the staphylococcus to surviveunder harsh conditions, its cell wall constituents, the production ofenzymes and toxins that promote tissue invasion, its capacity to persistintracellularly in certain phagocytes, and its potential to acquireresistance to antimicrobials. Important host factors include an intactmucocutaneous barrier, an adequate number of functional neutrophils, andremoval of foreign bodies or dead tissue.

Cell wall components of S. aureus include a large peptidoglycan complexthat confers rigidity on the organism and enables it to survive underunfavorable osmotic conditions, a unique teichoic acid linked topeptidoglycan, and protein A, which is found both attached topeptidoglycan over the outermost parts of the cell and released insoluble form. Proteins designated femA and femB are involved in theformation of cell wall peptidoglycan pentaglycine cross-bridges and arefactors in methicillin resistance. [Berger-Bachi et al., Mol. Gen.Genet., 219:263-269 (1989).] S. aureus also has specific receptors forlaminin and fibronectin that may mediate the organism's spread throughthe bloodstream to other tissues. Both peptidoglycan and teichoic acidare capable of activating the complement cascade via the alternativepathway. S. aureus also appears to activate tissue factor in thecoagulation pathway.

Certain enzymes produced by S. aureus may play a role in virulence.Catalase degrades hydrogen peroxide and may protect the organism duringphagocytosis. Coagulase is present in both soluble and cell-bound formsand causes plasma to clot by formation of thrombin-like material. Thehigh correlation between coagulase production and virulence suggeststhat this substance is important in the pathogenesis of staphylococcalinfections, but its precise role as a determinant of pathogenicity hasnot been determined. Many strains also produce hyaluronidase, an enzymethat degrades hyaluronic acid in the connective tissue matrix and thatmay promote spreading of infection. A trypsin-like protease from somestrains enhances influenza virus infection by proteolytic cleavage ofthe viral precursor hemagglutinin into its active fragments and maycontribute to the morbidity of such coinfections.

S. aureus produces numerous extracellular exotoxins that have beenimplicated in disease processes. The exfoliatin toxins A and B, thestaphylococcal enterotoxins, and the toxic shock syndrome toxin, TSST-1,belong to the growing family of microbial superantigens that activate Tcells and monocytes/macrophages, resulting in the production ofcytokines that mediate local or systemic effects depending on the amountof toxin formed, the immune status of the host, and the access of thetoxin to the circulation. The exfoliatin toxins mediate the dermatologicmanifestations of the staphylococcal scalded-skin syndrome and bullousimpetigo. These toxins cause intraepidermal cleavage of the skin at thestratum granulosum, leading to bullae formation and denudation. Sevendistinct enterotoxins (A, B, C1, C2, C3, D, and E) have been implicatedin food poisoning due to S. aureus. These toxins enhance intestinalperistalsis and appear to induce vomiting by a direct effect on thecentral nervous system. Toxic shock syndrome (TSS) is most conunonlymediated by TSST-1, which is present in 5 to 25 percent of clinicalisolates of S. aureus. TSS is also mediated less frequently byenterotoxin B and, rarely, enterotoxin C1.

S. aureus produces other toxins whose role in virulence is incompletelyunderstood. Four different red blood cell hemolysins, which aredesignated alpha, beta, gamma, and delta toxins, have been identified.Alpha toxin also causes necrosis of the skin when injectedsubcutaneously into animals, while delta toxin also inhibits waterabsorption in the intestines and may play a role in the acute waterydiarrhea seen in some cases of staphylococcal infection. Leukocidinlyses granulocyte and macrophage membranes by producing membrane porespermeable to cations.

The agr, xpr, sae and sar genes have been identified as being involvedin the regulation of staphylococcal exotoxins. See U.S. Pat. No.5,587,228 and International Patent Publication Nos. WO 96/10579 and WO97/11690. Of interest is the report in WO 97/11690 of screening forinhibitors of these regulatory systems.

Staphylococci can invade the skin or mucosa through plugged hairfollicles and sebaceous glands or areas traumatized by burns, wounds,abrasions, insect bites, or dermatitis. Staphylococci often colonizeprosthetic devices and intravenous catheters; S. aureus infection of thevascular access site is a major cause of morbidity and death amongpatients on hemodialysis. Colonization and invasion of the lungs mayoccur with endotracheal intubation, or when the lungs' clearancemechanisms are depressed, e.g., after viral infections, afteraspiration, or in patients with cystic fibrosis. Mucosal damage to thegastrointestinal tract following cytotoxic chemotherapy or radiotherapypredisposes to invasion from that site.

Once the skin or mucosa have been breached, local bacterialmultiplication is accompanied by inflammation, neutrophil accumulation,tissue necrosis, thrombosis and fibrin deposition at the site ofinfection. Later, fibroblasts create a relatively avascular wall aboutthe area. When host mechanisms fail to contain the cutaneous orsubmucosal infection, staphylococci may enter the lymphatics and thebloodstream. Common sites of metastatic spread include the lungs,kidneys, cardiac valves, myocardium, liver, spleen, bones and brain.

Bacteremia due to S. aureus may arise from any local infection, ateither extravascular (cutaneous infections, burns, cellulitis,osteomyelitis, arthritis) or intravascular foci (intravenous catheters,dialysis access sites, intravenous drug abuse). Rarely, patients withbacteremia die within 12 to 24 hours with high fever, tachycardia,cyanosis, and vascular collapse. Disseminated intravascular coagulationmay produce a disease mimicking meningococcemia. Commonly, the diseaseprogresses more slowly, with hectic fever and metastatic abscessformation.

A major complication of S. aureus bacteremia is endocarditis. S. aureusis the second most common cause of endocarditis and the most commoncause among drug addicts. The disease is typically acute, with highfever, progressive anemia, and frequent embolic and extracardiac septiccomplications. Valve ring and myocardial abscesses are common. Themortality rate is 20 to 30 percent.

Staphylococcal scalded-skin syndrome (SSSS) is a generalized exfoliativedermatitis that is a complication of infection by exfoliatintoxin-producing stranis of S. aureus. The disease typically occurs innewborns (Ritter's disease) and in children under the age of five. Ascarlatiniform rash begins in the perioral area, becomes generalizedover the trunk and extremities, and finally desquamates. The disease mayconsist of rash alone (staphylococcal scarlet fever), or large, flaccidbullae develop that may be localized (more common in adults) orgeneralized. The bullae burst, resulting in red, denuded skin resemblinga burn. Most adults with SSSS are immunosuppressed or have renalinsufficiency. Blood cultures are frequently positive, and mortality issignificant.

Toxic shock syndrome (TSS) is a multisystem disease mediated by toxins(generally TSST-1, and less frequently enterotoxins B and C1) producedby certain strains of S. aureus. It was first described in children, butin 1980 became epidemic among young women, with onset duringmenstruation. The diagnosis of TSS is based on clinical criteria thatinclude high fever, a diffuse rash that desquamates on the palms andsoles over the subsequent one or two weeks, hypotension that may beorthostatic, and evidence of involvement in three or more organ systems.Such involvement commonly includes gastrointestinal dysfunction(vomiting or diarrhea), renal or hepatic insufficiency, mucous membranehyperemia, thrombocytopenia, myalgias with elevated creatinephosphokinase (CK) levels, and disorientation with a normalcerebrospinal fluid examination. The mortality rate of TSS is threepercent.

S. aureus causes approximately three percent of community-acquiredbacterial pneumonias. This disease occurs sporadically except duringinfluenza outbreaks, when staphylococcal pneumonia is relatively morecommon, although still less frequent than pneumococcal pneumonia.Primary staphylococcal pneumonia in infants and children frequentlypresents with high fever and cough. Multiple thin-walled abscesses areseen on the chest X-ray, and empyema formation is common. In olderchildren and healthy adults, staphylococcal pneumonia is generallypreceded by an influenza-like respiratory infection. Onset ofstaphylococcal involvement is abrupt, with chills, high fever,progressive dyspnea, cyanosis, cough, pleural pain, and sometimes bloodysputum. Staphylococcal pneumonia is seen more frequently in patientswith cystic fibrosis, in intubated patients in intensive care units andin debilitated patients who are prone to aspiration.

S. aureus is responsible for the majority of cases of acuteosteomyelitis. Although the disease is most common in people under theage of 20, it is becoming increasingly prevalent in adults over 50,particularly with involvement of the spine. A primary portal of entry isfrequently not identified, although many patients give a history ofpreceding trauma to the involved area. Once established, infectionspreads through the bone to the periosteum or along the marrow cavity.Rarely, the joint capsule is penetrated, producing pyogenic arthritis.Osteomyelitis in children may present as an acute process beginningabruptly with chills, high fever, nausea, vomiting, and progressive painat the site of bony involvement.

S. aureus causes 1 to 9 percent of cases of bacterial meningitis and 10to 15 percent of brain abscesses. Most commonly, the bacteria are spreadfrom a focus outside the central nervous system, typically frominfective endocarditis, by extension from a paraspinal or parameningealabscess, or by nosocomial infection following neurosurgical procedures.Over 50 percent of epidural abscesses are due to S. aureus; up to halfof these cases may be associated with vertebral osteomyelitis. Patientspresent with either acute or chronic back pain, usually with low-gradefever and malaise. The onset of radicular pain is an ominous sign thatthe disease may progress to neurologic dysfunction and ultimateparalysis.

Antimicrobial resistance by staphylococci favors their persistence inthe hospital environment. Over 90 percent of both hospital and communitystrains of S. aureus causing infection are resistant to penicillin. Thisresistance is due to the production of β-lactamases enzymes; the genesfor these enzymes are usually carried by plasmids. Infections due toorganisms with such acquired resistance can sometimes be treated withpenicillinase-resistant β-lactam antimicrobial agents. However, the truepenicillinase-resistant S. aureus organisms, calledmethicillin-resistant S. aureus (MRSA), are resistant to all theβ-lactam antimicrobials as well as the cephalosporins. MRSA resistanceis chromosomally mediated and involves production of an alteredpenicillin-binding protein (PBP 2a or PBP 2′) with a low bindingaffinity for β-lactams. MRSA frequently also have acquired plasmidsmediating resistance to erythromycin, tetracycline, chloramphenicol,clindamycin, and aminoglycosides. MRSA have become increasingly commonworldwide, particularly in tertiary-care referral hospitals. In theUnited States, approximately 5 percent of hospital isolates of S. aureusare methicillin-resistant.

Thus, there continues to exist a need for new agents useful for treatingbacterial infections, particularly those caused by antibioic-resistantbacteria, and for methods of identifying such new agents. Such methodsideally would identify agents that are unrelated to existingantimicrobials and that target different aspects of staphylococcalinvasion of and replication in the host, compared to existingantimicrobials.

SUMMARY OF THE INVENTION

The present invention relates generally to the identification of genesresponsible for the virulence of Staphylococcus bacteria, therebyallowing the identification of new anti-bacterial agents that targetthese virulence genes and their products and the provision of novel S.aureus mutants useful in vaccines.

According to one aspect of the present invention, methods are providedfor identifying anti-bacterial agents that target the function ofstaphylococcal virulence genes or gene products. Such methods includeassaying potential agents for the ability to interfere with expressionof virulence gene products represented by the DNA sequences set forth inany one of SEQ ID NOS: 1 through 94, or assaying potential agents forthe ability to interfere with the function of a bacterial proteinencoded in whole or in part by a DNA sequence set forth in any one ofSEQ ID NOS: 1 through 94 or the complementary strand thereof, followedby identifying agents that are positive in such assays.

The use of a number of different assays is contemplated according tothis aspect of the invention. When the function of the virulence geneproduct is known or predicted by sequence similarity to a known geneproduct, potential inhibitors can be screened in enzymatic or othertypes of assays keyed to the function of the gene product. When thevirulence gene product is known or predicted by sequence similarity to aknown gene product to interact with another protein or nucleic acid,inhibitors of this interaction can be screened directly in bindingassays or using the two-hybrid assay. Other assays may be used when aligand for the virulence gene product is not known, including two-hybridscreening assays that identify gene products that interact with targetprotein, assays that identify ligands of target protein throughmeasuring of direct binding of test ligand to target protein, and assaysthat identify ligands of target proteins through affinityultrafiltration with ion spray mass spectroscopy/HPLC methods or otherphysical and analytical methods.

In another aspect of this invention, methods are provided for assayingpotential agents for the ability to interfere with expression of orfunction of virulence gene products, wherein the virulence genesencoding these products are obtainable by identification throughsignature-tagged mutagenesis as defined herein and exemplified inExample 1.

According to a further aspect of this invention, novel anti-bacterialagents identified by the methods described herein are provided, as wellas methods for treating a subject suffering from infection withstaphylococci involving administration of such novel anti-bacterialagents. In particular, agents that interfere with the expression ofvirulence gene products include anti-sense polynucleotides that arecomplementary to the virulence gene sequences. Agents that interferewith the function of virulence gene products include variants ofvirulence gene products, ligands of these virulence gene products andvariants thereof, and enzyme inhibitors (where the product is anenzyme).

Yet a further aspect of this invention provides Staphylococcus aureusorganisms containing a functional mutation in a gene represented by anyone of SEQ ID NOS: 1 through 94, said functional mutation resulting in areduction in virulence of the organism. Also contemplated are vaccinecompositions comprising such mutated S. aureus organisms, optionallycomprising a suitable adjuvant and a pharmaceutically acceptable diluentor carrier.

Numerous additional aspects and advantages of the invention will becomeapparent to those skilled in the art upon consideration of the followingdetailed description of the invention which describes presently preparedembodiments thereof.

DETAILED DESCRIPTION OF THE INVENTION

“Virulence genes,” as used herein, are genes whose function or productsare required for successful establishment and/or maintenance ofbacterial infection in a host animal. Thus, virulence genes and/or theproteins encoded thereby are involved in pathogenesis in the hostorganism, but may not be necessary for growth in vitro. Sinceantibiotics are typically screened in vitro, identification of these invivo virulence genes provides a means for discovering new antimicrobialswith different targets and mechanisms of action compared to existingantibiotics. There may be 50 to 100 virulence genes in S. aureus [seeGroisman and Ochman, Trends Microbiol. Sci., 2:289-294 (1984)(discussing Salmonella virulence genes); Muhldorfer and Hacker, Microb.Pathogenesis, 16:171-181 (1994) (discussing E. coli virulence genes).

“Signature-tagged mutagenesis,” as used herein, is a method generallydescribed in International Patent Publication No. WO 96/17951,incorporated herein by reference, and includes, for example, a methodfor identifying S. aureus genes required for virulence in a murine modelof bacteremia. In this method, each insertional mutation carries adifferent DNA signature tag which allows mutants to be differentiatedfrom each other. The tags comprise 40-bp variable central regionsflanked by invariant “arms” of 20-bp which allow the central portions tobe co-amplified by polymerase chain reaction (PCR). Tagged mutantstrains are assembled in microtitre dishes, then combined to form the“inoculum pool” for infection studies. At an appropriate time afterinoculation, bacteria are isolated from the animal and pooled to formthe “recovered pool.” The tags in the recovered pool and the tags in theinoculum pool are separately amplified, labelled, and then used to probefilters arrayed with the different tags representing the mutants in theinoculum. Mutants with attenuated virulence are those with tags thatgive hybridization signals when probed with tags from the inoculum poolbut not when probed with tags from the recovered pool.

Signature-tagged mutagenesis allows a large number of insertional mutantstrains to be screened simultaneously in a single animal for loss ofvirulence. Screening thirteen pools of 96 mutant S. aureus strainsresulted in the identification of 50 strains with reduced virulence,many of which were confirmed to be attenuated in virulence by subsequentanalysis of individual mutants. The nucleotide sequences of the regionsflanking the transposon insertion points of these mutants were analyzedby searching DNA and protein sequence databases to identify the genesinactivated by the insertion of the transposon.

On the basis of these searches many of the virulence genes may begrouped into different classes. The first class encodes proteinsinvolved in cell surface metabolism (e.g., P2C73, P11C29, P13C83, P9B65,P10B89). Both femA and femB, which are involved in the formation of cellwall peptidoglycan pentaglycine cross bridges, were identified asvirulence genes. Mutant P2C73 contains a transposon insertion in apreviously unknown gene that shares significant similarity to femB.Mutant P14C15 contains a transposon insertion in a gene whose product issimilar to aspartate semialdehyde dehydrogenase (Asd) from differentbacteria, with the highest level of similarity to Asd from Bacillussubtilis. Asd is a key enzyme in the biosynthesis of methionine,threonine, isoleucine, lysine and diaminopimelic acid (DAP), which is animportant component of cell wall peptidoglycan.

The second class encodes enzymes involved in cellular biosyntheticpathways (e.g., P9B74, P5C4, P9B66, P14C15, P13B26, P7C18, P15C31,P10B18, P6B18, P10B66, P10C34, P12C3). Deduced protein products of twogenes (mutants P7C18 and P13B26) show strong similarity to B. subtilisLysA and ThrB. These enzyzmles, like Asd, are involved in aspartatemetabolism. LysA is diaminopimelate decarboxylase, which convertsdiaminopimelate to lysine, and ThrB phosphorylates homoserine beforeconversion into threonine. Transposon insertions were also obtained ingenes homologous to Methanococcus jannaschii trpA, Lactococcus lactistrpB and L. lactis trpD. These genes encode enzymes of the tryptophanbiosynthetic pathway: the alpha chain of tryptophan synthetase, the betachain of tryptophan synthetase, and anthranilatephosphoribosyltransferase, respectively. The gene mutated in P15C31 is ahomolog of L. lactis purL encoding phosphoribosylformylglycinamidinedecarboxylase, an enzyme of the purine biosynthetic pathway. MutantP9B66 contains an insertion in a gene whose product is similar topeptide methionine sulphoxide reductases.

A third class of genes are those encoding components of the TCA cycle(e.g., mutants P4C27, P4C52, P10B2, P10C20, P12C32). Strains P10B2 andP12C32 carry mutations in genes for a subunit of the oxoglutaratedehydrogenase complex and aconitase, respectively.

The fourth class is composed of genes whose products are similar to aoligopeptide transport proteins of the ATP-binding cassette (ABC)transporter superfamily (e.g., mutants P7C26, P10C15, P5C3, P11C66,P5C34). Oligopeptide transport is important for peptide utilization andthe proteolytic system in lactococci. In Group A streptococci, Oppproteins are involved not only in obtaining nutrients but also inadherence, protease production and processing of secreted proteins.

The fifth class of genes are involved in cellular regulatory and repairprocesses (e.g., mutants P4C15, P13B74, P13C72, P10B30, P6C63, P14B25).Mutant P4C15 and P6C63 contain insertions in S. aureus MarR/LuxR-likeregulatory proteins. MarR and LuxR belong to a family of transcriptionregulators and these MarR/LuxR-like proteins likely have a similarfunction in S. aureus. In Streptococcus pneumoniae, Neisseriagonorrhoeae and Escherichia coli this enzyme helps to maintain surfaceadhesins in their functional oxidative state. Mutant P10B30 isassociated with a transposon insertion in a gene with a product similarto the ATP-dependent Clp protease of E. coli. The Clp stress responsesystem for intracellular protein degradation is widely conserved inbacteria and components of the system are important for virulence ofListeria monocytogenes and S. typhimurium. Mutants P13B74 and P13C72have stem-loop termination sequences which possibly function intranscription termination.

The identification of these genes as virulence genes renders them usefulin methods of identifying new anti-bacterial agents according to thepresent invention. Such methods include assaying potential agents forthe ability to interfere with expression of virulence gene productsrepresented by the DNA sequences set forth in any one of SEQ ID NOS: 1through 94 (i.e., the genes represented by DNA sequences of SEQ ID NOS:1 through 94 encode the virulence gene product, or the DNA sequences ofSEQ ID NOS: 1 through 94 are adjacent to the gene encoding the virulencegene product, or are involved in regulation of expression of thevirulence gene product), or assaying potential agents for the ability tointerfere with the function of a bacterial protein encoded in whole orin part by a DNA sequence set forth in any one of SEQ ID NOS: 1 through94 or the complementary strand thereof, followed by identifying agentsthat are positive in such assays. Polynucleotides and polypeptidesuseful in these assays include not only the genes and encodedpolypeptides as disclosed herein, but also variants thereof that havesubstantially the same activity as the wild-type genes and polypeptides.“Variants,” as used herein, includes polynucieotides or polypeptideswhich contain one or more deletions insertions or substitutions, as longas the variant retains substantially the same activity of the wild-typepolynucleotide or polypeptide. With regard to polypeptides, deletionvariants are contemplated to include fragments lacking portions of thepolypeptide not essential for biological activity, and insertionvariants are contemplated to include fusion polypeptides in which thewild-type polypeptide or fragment thereof have been fused to anotherpolypeptide.

The virulence genes may be cloned by PCR, using S. aureus genomic DNA asthe template. For ease of inserting the gene into expression vectors,PCR primers are chosen so that the PCR-amplified gene has a restrictionenzyme site at the 5′ end preceding the initiation codon ATG, and arestriction enzyme site at the 3′ end after the termination codon TAG,TGA or TAA. If desirable, the codons in the gene are changed, withoutchanging the amino acids, according to E. coli codon preferencedescribed by Grosjean and Fiers, Gene, 18:199-209 (1982), and Konigsbergand Godson, Proc. Natl. Acad. Sci. (USA), 80:687-691 (1983).Optimization of codon usage may lead to an increase in the expression ofthe gene product when produced in E. coli. If the gene product is to beproduced extracellularly, either in the periplasm of E. coli or otherbacteria, or into the cell culture medium, the gene is cloned withoutits initiation codon and placed into an expression vector behind asignal sequence. For example, cloning and expression of the femA gene isdescribed in Example 3 below.

To simplify the protein purification process, a purification tag may beadded either at the 5′ or 3′ end of the gene coding sequence. Commonlyused purification tags include a stretch of six histidine residues (U.S.Pat. Nos. 5,284,933 and 5,310,663), a streptavidin-affinity tagdescribed by Schmidt and Skerra, Protein Engineering, 6:109-122 (1993),a FLAG peptide [Hopp et al., Biotechnology, 6:1205-1210 (1988)],glutathione S-transferase [Smith and Johnson, Gene, 67:31-40 (1988)],and thioredoxin [LaVallie et al., Bio/Technology, 11:187-193 (1993)]. Toremove these peptide or polypeptides, a proteolytic cleavage recognitionsite may be inserted at the fusion junction. Commonly used proteases arefactor Xa, thrombin, and enterokinase.

Proteins are produced in any number of well-known prokaryotic oreukaryotic expression systems using known promoters, vectors, and hosts.Any suitable host cell may be used for expression of the gene product,such as E. coli, other bacteria, including Bacillus and S. aureus,yeast, including Pichia pastoris and Saccharomyces cerevisiae, insectcells, or mammalian cells, including CHO cells, utilizing suitablevectors known in the art. Proteins may be produced directly or fused toa peptide or polypeptide, and either intracellularly or extracellularlyby secretion into the periplasmic space of a bacterial cell or into thecell culture medium. Secretion of a protein requires a signal peptide(also known as pre-sequence); a number of signal sequences fromprokaryotes and eukaryotes are known to function for the secretion ofrecombinant proteins. During the protein secretion process, the signalpeptide is removed by signal peptidase to yield the mature protein.

The virulence gene products produced by the methods described above areused in high throughput assays to screen for inhibitory agents. Thesources for potential agents to be screened are chemical compoundlibraries, fermentation media of Streptomycetes, other bacteria andfungi, and cell extracts of plants and other vegetations. For proteinswith known enzymatic activity, assays are established based on theactivity, and a large number of potential agents are screened forability to inhibit the activity. For proteins that interact with anotherprotein or nucleic acid, binding assays are established to measure suchinteraction directly, and the potential agents are screened for abilityto inhibit the binding interaction.

Alternatively, such binding interactions are evaluated indirectly usingthe yeast two-hybrid system described in Fields and Song, Nature,340:245-246 (1989), and Fields and Sternglanz, Trends in Genetics,10:286-292 (1994), both of which are incorporated herein by reference.The two-hybrid system is a genetic assay for detecting interactionsbetween two proteins or polypeptides. It can be used to identifyproteins that bind to a known protein of interest, or to delineatedomains or residues critical for an interaction. Variations on thismethodology have been developed to clone genes that encode DNA-bindingproteins, to identify peptides that bind to a protein, and to screen fordrugs. The two-hybrid system exploits the ability of a pair ofinteracting proteins to bring a transcription activation domain intoclose proximity with a DNA-binding domain that binds to an upstreamactivation sequence (UAS) of a reporter gene, and is generally performedin yeast. The assay requires the construction of two hybrid genesencoding (1) a DNA-binding domain that is fused to a protein X, and (2)an activation domain fused to a protein Y. The DNA-binding domaintargets the first hybrid protein to the UAS of the reporter gene;however, because most proteins lack an activation domain, thisDNA-binding hybrid protein does not activate transcription of thereporter gene. The second hybrid protein, which contains the activationdomain, cannot by itself activate expression of the reporter genebecause it does not bind the UAS. However, when both hybrid proteins arepresent, the noncovalent interaction of protein X and protein Y tethersthe activation domain to the UAS, activating transcription of thereporter gene. When the virulence gene product (protein X, for example)is already known to interact with another protein or nucleic acid(protein Y, for example), this assay can be used to detect agents thatinterfere with the interaction of X and Y. Expression of the reportergene is monitored as different test agents are added to the system; thepresence of an inhibitory agent results in lack of a reporter signal.

When the function of the virulence gene product is unknown and noligands are known to bind the gene product, the yeast two-hybrid assaycan also be used to identify proteins that bind to the gene product. Inan assay to identify proteins that bind to protein X (the targetprotein), a large number of hybrid genes each containing a differentprotein Y are produced and screened in the assay. Typically, Y isencoded by a pool of plasmids in which total cDNA or genomic DNA isligated to the activation domain. This system is applicable to a widevariety of proteins, and it is not even necessary to know the identityor function of protein Y. The system is highly sensitive and can detectinteractions not revealed by other methods; even transient interactionsmay trigger transcription to produce a stable mRNA that can berepeatedly translated to yield the reporter protein.

Other assays may be used to search for agents that bind to the targetprotein. One such screening method to identify direct binding of testligands to a target protein is described in U.S. Pat. No. 5,585,277,incorporated herein by reference. This method relies on the principlethat proteins generally exist as a mixture of folded and unfoldedstates, and continually alternate between the two states. When a testligand binds to the folded form of a target protein (i.e., when the testligand is a ligand of the target protein), the target protein moleculebound by the ligand remains in its folded state. Thus, the folded targetprotein is present to a greater extent in the presence of a test ligandwhich binds the target protein, than in the absence of a ligand. Bindingof the ligand to the target protein can be determined by any methodwhich distinguishes between the folded and unfolded states of the targetprotein. The function of the target protein need not be known in orderfor this assay to be performed. Virtually any agent can be assessed bythis method as a test ligand, including, but not limited to, metals,polypeptides, proteins, lipids, polysaccharides, polynucleotides andsmall organic molecules. For example, use of femA in this method ofscreening for potential ligands is described in Example 3 below.

Another method for identifying ligands for a target protein is describedin Wieboldt et al., Anal. Chem., 69:1683-1691 (1997), incorporatedherein by reference. This technique screens combinatorial libraries of20-30 agents at a time in solution phase for binding to the targetprotein. Agents that bind to the target protein are separated from otherlibrary components by centrifugal ultrafiltration. The specificallyselected molecules that are retained on the filter are subsequentlyliberated from the target protein and analyzed by HPLC and pneumaticallyassisted electrospray (ion spray) ionization mass spectroscopy. Thisprocedure selects library components with the greatest affinity for thetarget protein, and is particularly useful for small molecule libraries.

The inhibitors/binders identified by the initial screens are evaluatedfor their effect on virulence in vivo mouse models of S. aureusinfections. Models of bacteremia, endocarditis, septic arthritis, softtissue abscess or pneumonia may be utilized. Inhibitors/binders thatinterfere with bacterial virulence are capable of preventing theestablishment of an infection or reversing the outcome of an infectiononce it is established.

The identification of S. aureus virulence genes also provides formicroorganisms exhibiting reduced virulence, which are useful invaccines. Such microorganisms include the S. aureus mutants generated inExample 1 below and other S. aureus mutants containing at least onefunctional mutation in a gene represented by any one of SEQ ID NOS: 1through 94. The reduced virulence of these organisms and theirimmunogenicity may be confirmed by administration to a subject. While itis possible for an avirulent microorganism of the invention to beadministered alone, one or more of such mutant microorganisms arepreferably administered in a vaccine composition containing suitableadjuvant(s) and pharmaceutically acceptable diluent(s) or carrier(s).The carrier(s) must be “acceptable” in the sense of being compatiblewith the avirulent microorganism of the invention and not deleterious tothe subject to be immunized. Typically, the carriers will be water orsaline which will be sterile and pyrogen free. The subject to beimmunized is a subject needing protection from a disease caused by avirulent form of S. aureus.

Any adjuvant known in the art may be used in the vaccine composition,including oil-based adjuvants such as Freund's Complete Adjuvant andFreund's Incomplete Adjuvant, mycolate-based adjuvants (e.g., trehalosedimycolate), bacterial lipopolysaccharide (LPS), peptidoglycans (i.e.,mureins, mucopeptides, or glycoproteins such as N-Opaca, muramyldipeptide [MDP], or MDP analogs), proteoglycans (e.g., extracted fromKlebsiella pneumoniae), streptococcal preparations (e.g., OK432),Biostim™ (e.g., 01K2), the “Iscoms” of EP 109 942, EP 180 564 and EP 231039, aluminum hydroxide, saponin, DEAE-dextran, neutral oils (such asmiglyol), vegetable oils (such as arachis oil), liposomes, Pluronic®polyols or the Ribi adjuvant system (see, for example GB-A-2 189 141).Recently, an alternative adjuvant consisting of extracts of Amnycolata,a bacterial genus in the order Actinomycetales, has been described inU.S. Pat. No. 4,877,612. Additionally, proprietary adjuvant mixtures arecommercially available. The adjuvant used will depend, in part, on therecipient organism. The amount of adjuvant to administer will depend onthe type and size of animal. Optimal dosages may be readily determinedby routine methods.

The vaccine compositions optionally may include pharmaceuticallyacceptable (i.e., sterile and non-toxic) liquid, semisolid, or soliddiluents that serve as pharmaceutical vehicles, excipients, or media.Any diluent known in the art may be used. Exemplary diluents include,but are not limited to, polyoxyethylene sorbitan monolaurate, magnesiumstearate, methyl- and propylhydroxybenzoate, talc, alginates, starches,lactose, sucrose, dextrose, sorbitol, mannitol, gum acacia, calciumphosphate, mineral oil, cocoa butter, and oil of theobroma.

The vaccine compositions can be packaged in forms convenient fordelivery. The compositions can be enclosed within a capsule, sachet,cachet, gelatin, paper or other container. These delivery forms arepreferred when compatible with entry of the immunogenic composition intothe recipient organism and, particularly, when the immunogeniccomposition is being delivered in unit dose form. The dosage units canbe packaged, e.g., in tablets, capsules, suppositories or cachets.

The vaccine compositions may be introduced into the subject to beimmunized by any conventional method including, e.g., by intravenous,intradermal, intramuscular, intramammary, intraperitoneal, orsubcutaneous injection; by oral, sublingual, nasal, anal, vaginal, ortransdermal delivery; or by surgical implantation, e.g., embedded underthe splenic capsule or in the cornea. The treatment may consist of asingle dose or a plurality of doses over a period of time.

It will be appreciated that the vaccine of the invention may be usefulin the fields of human medicine and veterinary medicine. Thus, thesubject to be immunized may be a human or an animal, for example, cows,sheep, pigs, horses, dogs and cats, and poultry such as chickens,turkeys, ducks and geese.

Other aspects and advantages of the present invention will be understoodupon consideration of the following illustrative examples. Example 1addresses identification of S. aureus genes associated with virulence bygenerating S. aureus mutants that contain a chromosomal insertion of asignature-tagged transposon and identifying the mutants with reducedvirulence. Example 2 addresses confirmation of the attenuated virulenceof individual mutants. Example 3 addresses use of the virulence genesand gene products in assays for screening potential agents foranti-bacterial activity.

FIGURE LEGENDS

FIG. 1. A. Map of svr and plasmids for complementation. Sites forrestriction enzymes are indicated (A, AluI; E, EcoRI; P, Pst I). B.Deduced amino acid sequence of the Svr protein. C. Southernhybridization of chromosomal DNA. DNA was extracted from S. aureusstrains RN6390, Newman, Wood 46, ID401 and ID402 and digested withHindIII. The digested DNA was separated by agarose gel electrophoresis,transferred onto nylon membrane and probed with ³²P-labeled svr gene.

FIG. 2. Phenotypic analysis of P6C63. A. Production of α-, β- andδ-toxins. S. aureus strains were streaked onto rabbit blood agar plate(for α-toxin), sheep blood agar plates (for β-toxin) and horse bloodagar plates (for δ-toxin) culture overnight. Halos surrounding bacterialcolonies are indicative of toxin activity. B. Western immunoblot ofprotein A. Whole cell proteins were extracted from each strain and equalamounts were separated by SDS-PAGE electrophoresis followed by Westernimmunoblotting with anti-protein A monoclonal antibody. C. Proteinsamples in (B) detected by PAGE and Coomassie Blue staining.

FIG. 3. In vivo survival kinetics of bacterial strains. Two groups ofmice were infected i.p. with 1×10⁵ cfu of S. aureus wild-type strainRN6390 or svr mutant P6C63. At 6, 24, 48, 72, 96 and 120 hourspost-inoculation, two mice/group were killed and the number of bacterialcfu/spleen was determined. Data are the mean ± standard deviations ateach time point.

FIG. 4. A. RT-PCR analysis of svr transcripts in strains RN6390, RN6911and P6C63. Total RNA isolated from 10⁹ cfu of post-exponential phasegrowth S. aureus strains. RT-PCR was performed using two primers thatamplify a 200 bp region within the svr gene. B and C. Northern blotanalysis. Total RNA was isolated from 10⁹ cfu of post-exponential phasegrowth S. aureus strains RN6390, RN6911, P6C63 and complemented strainP6C63(pID437). RNA was separated by 1.5% agarose-0.66M formaldehyde gelelectrophoresis, transferred to a nylon membrane and probed with³²P-labelled genes as indicated.

EXAMPLE 1 Identification of S. aureus Genes Associated With Virulence

S. aureus genes associated with virulence were identified bysignature-tagged mutagenesis as follows, generally according toInternational Patent Publication No. WO 96/17951 and Hensel et al.,Science, 269:400-403 (1995).

A. Construction of Plasmid pID408

A temperature-sensitive shuttle plasmid pID408 was constructed for usein transferring the signature-tagged transposons into S. aureus. DNArestriction digestions and ligations were performed as described bySambrook et al., Molecular cloning: a laboratory manual, 2nd ed., ColdSpring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1989). A 6.5kb BamHI fragment of pTV32ts [described in Youngman, in Plasmids: apractical approach, K. Hard, ed., Oxford: IRL Press, pages 79-103(1985); obtained from J. Iandolo, Department of Diagnostic Medicine,College of Veterinary Medicine, Manhattan, Kans.] carrying thetemperature-sensitive replicon pE194ts and a chloramphenicol (cm)resistance gene (cm^(R)) was inserted into the BamHI site of plasmidpSP72 (Promega), forming pID402. Plasmid pID402 was digested with SmaIand self-ligated to remove a small fragment containing a BamHI sitebetween the SmaI sites, resulting in pID405. A DNA fragment of pBR322(New England BioLabs) containing the ampicillin resistance gene(amp^(R))/ori/rop region (corresponding to bases 1904-4359 of pBR322)was prepared by polymerase chain reaction (PCR) amplification usingprimers pBR1 (5′-GGAGCTCACTAGTCGGAGGCATCAGTGACC-3′, SEQ ID NO: 95) andpBR2 (5′-GGGATCCCATGAGAATTCTTGAAGACG-3′, SEQ ID NO: 96). Primer pBR1contains sites for SacI and SpeI, and pBR2 contains sites for BamHI andEcoRI. The PCR product was digested with BamRI and SacI and ligated tothe BamHI/SacI-digested fragment of pID405 (carrying replicon pE194tsand cm^(R)) to create pID406, in which the amp^(R)/ori region of pSP72is replaced with the replicon of pBR322. Finally, a 6.5 kb BamHI-EcoRIfragment from pTV9 [described in Youngman, in Plasmids: a practicalapproach, K. Hard, ed., Oxford: IRL Press, pages 79-103 (1985)],containing transposon Tn917 and its erythromycin (erm) resistance gene(erm^(R)), was ligated with BamHI and EcoRI digested pID406, resultingin plasmid pID408. The origin of replication from pBR322 allows pID408to replicate in E. coli, and the temperature-sensitive replicon pE194ts(from pTV32ts) allows replication in S. aureus at 32° C.

B. Cloning and Selection of Signature Tags

DNA signature tags were prepared as described by Hensel et al., Science,269:400-403 (1995) and inserted into the transposon Tn917 carried bypID408 as follows. Oligonucleotides tags in pool RT1(5′-CTAGGTACCTACAACCTCAAGCTT-[NK]₂₀-AAGCTTGGTTAGAATGGGTACCATG-3′, SEQ IDNO: 97, where N is A, C, G or T, and K is G or T were synthesized on anoligonucleotide synthesizer (Applied Biosystems). The 80 bp tags fromoligonucleotide pool RT1 were PCR-amplified from the using primers P10(5′-CTAGAATTCTACAACCTCAAGCTT-3′, SEQ ID NO: 98) and P11(5′-AAGCTTGGTTAGAATGGAATTCATG-3′, SEQ ID NO: 99). The PCR-amplified tagswere digested with EcoRI, gel-purified, and ligated with EcoRI digested,dephosphorylated pID408, to form plasmids containing uniquelysignature-tagged Tn917 transposons. The ligated DNA was transformed intoS. aureus strain RN4220 (a restriction defective strain derived fromNCTC 8325-4 and described in Youngman, in Plasmids: a practicalapproach, K. Hard, ed., Oxford: IRL Press, pages 79-103 (1985) byelectroporation. Electroporation was performed by the method of Schenkand Laddaga, FEMS Microbiol. Lett., 94:133-138 (1992), with thefollowing modifications: electrotransformed bacteria were plated onnon-selective brain-heart infuision (BHI, Difco) agar plates (1.5%agar), grown at 32° C. for 8 to 12 hours and then replicated using anAccutran Replica Plater (Schleicher and Schuell) onto BHI agar platescontaining 20 μg/ml erm and 20 μg/ml cm, and grown at 32° C. overnight.

A total of 400 erm^(R), cm^(R) transformants were then screened toidentify 150 tags that amplified and labelled efficiently. The 150transformants were each cultured in 10 ml BHI broth containing 20 μg/mlerm and 20 μg/ml cm at 32° C. overnight. Plasmid DNA was extracted fromeach culture using a Qiagen Plasmid Miniprep Kit according to themanufacturer's protocol except that the bacterial cells were lysed bylysostaphin (Sigma; 200 mg/ml) at 37° C. for 30 min. in the P1 solutionof the Qiagen miniprep kit. An aliquot of each plasmid DNA preparation(1 μg) was transferred onto a Hybond N⁺ membrane (Amersham) by using aBio-Dot Microfiltration Apparatus according to the manufacturer'sprotocol (Bio-Rad) to give 75 different plasmids per membrane. Themembranes were then transferred to Whatman 3MM paper saturated withdenaturing solution (0.5 N NaOH, 1.5 M NaCl), incubated for 5 min. thentransferred to another piece of Whatman 3MM paper saturated withneutralizing solution (1.5 M NaOH, 0.5 M Tris-Cl pH 7.4) for 5 min.After neutralization, the membranes were dried at 80° C. for 10 min andthe DNA cross-linked by UV light using a Stratalinker (Stratagene). Thesignature tags from the 150 transformants were also labelled for use asprobes. Aliquots of the plasmid preparations were combined to form fourpools of 37, 37, 38 and 38 plasmids. The DNA signature tags from thesefour pools were PCR-amplified with primers P12 (5′-GAATTCCATTCTAAC-3′,SEQ ID NO: 100) and P13 (5′-ATTCCATTCTAACCAAGC-3′, SEQ ID NO: 101)according to Hensel et al., Science, 269:400-403 (1995). These PCRproducts were gel-purified, subjected to a digoxygenin (DIG)-labellingPCR using a PCR DIG Probe Synthesis Kit as described by the manufacturer(Boehringer Mannheim). The DIG-labelled PCR products were hybridizedseparately with the membranes according to the method described in theDIG Probe Synthesis Kit, to identify tags that gave strong hybridizationsignals, but which did not cross-hybridize with tags in the other pools.From these experiments, a total of 96 plasmids were chosen on the basisof labelling efficiency of tags and lack of cross-hybridization.

C. Generation of the S. aureus Mutant Bank

The 96 plasmids containing uniquely signature tagged-transposons wereused to generate S. aureus mutants with a chromosomal transposoninsertion. A series of identical membranes for dot-blot hybridizationswere prepared by transferring 1 μg of each of the 96 plasmids ontoHybond N⁺ membranes using the Bio-Dot Microfiltration Apparatus. The 96plasmids were separately transformed into S. aureus strain RN6390 byelectroporation [Schenik and Laddaga, FEMS Microbiol. Lett., 94:133-138(1992)]. Electrotransformed bacteria were plated directly on BHI agarplates containing 20 μg/ml cm and 20 μg/ml erm at 32° C. overnight. Asingle colony from each transformation was transferred into each well ofa microtitre dish containing 200 μl BHI broth, 20 μg/ml erm and 20 μg/mlcm. The microtitre dish was incubated at 32° C. overnight. Glycerol wasadded to each well to give a final concentration of 50% and the plateswere stored at 80° C. This collection of 96 transformants was designatedthe master pool and was used for all subsequent mutagenesis.

To generate 96 different Tn917 mutants, bacteria from the master poolwere replicated using a microtitre dish replicator (Sigma) into thewells of a new microtitre dish containing 200 μl BHI broth and 20 μg/mlerm. This dish was incubated at 43° C. overnight and then bacteria fromeach well were streaked on BHI agar containing 20 μg/ml erm andincubated at 43° C. overnight to obtain single colonies. Approximatelyfive different erm^(R) colonies obtained from each well were transferredonto BHI agar containing 20 μ/ml cm and incubated at 32° C., and ontoBHI agar containing 20 μg/ml erm and incubated at 43° C., to screen forcm^(S) colonies. Erm^(R), cm^(S) colonies should lack the plasmid andcarry a chromosomal insertion of Tn917. Individual erm^(R), cm^(S)colonies from each of the 96 wells were transferred into a new 96 wellmicrotitre dish with BHI broth containing 20 μg/ml erm and grownovernight at 43° C. for use as inoculum. These 96 mutants, takentogether, form one “inoculum pool.” Mutant pools were also stored at−80° C. in 50% glycerol.

To verify that erm^(R), cm^(S) colonies lack the plasmid and carry asingle chromosomal insertion of Tn917, chromosomal DNA samples fromindividual mutant strains and from a pool of 46 colonies that had beencultured separately were digested with EcoRI and subjected to Southernanalysis using a fragment of the β-lactamase gene as a probe. For eachof 12 separately analyzed mutants, a single hybridizing fragment ofdifferent size was observed in each lane. A large number of hybridizingfragments were observed in the lane containing DNA from 46 mutants.These results indicate that the majority of Tn917 insertions occursingly and at different locations in the S. aureus chromosome.

D. Infection Studies to Identify Mutants with Reduced Virulence

The “inoculum pool” of 96 S. aureus mutants containing chromosomaltransposon insertions was evaluated for attenuated virulence in a mousemodel of bacteremia. After overnight incubation at 43° C., the 96different mutants from individual wells of the microtitre dish werepooled together and washed twice with BHI broth by centrifugation at4000×g for 10 min and resuspended in BHI broth. The OD₆₂₀ was determinedusing a spectrophotometer (OD₆₂₀ of 1.6×10⁹ colony forming units(CFU)/ml) [Cheung et al., J. Bacteriol, 177:3220-3226 (1995).]. Thebacterial suspension was diluted to approximately 5×10⁶ CFU/ml and thenmixed with an equal volume of pre-autoclaved 4% (w/v) Brewer's yeast(Sigma) in BHI broth. A total of 0.2 ml of this mixture, containingapproximately 5×10⁵ CFU bacteria, was injected intraperitoneally into aCD-1 mouse (approximately 25 g in weight). The number of CFU in theinoculum was verified by viable counts after plating a diluted aliquotof the inoculum to BHI agar. Two to four mice were inoculated with eachpool. 48 hours after inoculation, bacteria were recovered from thespleens of the animals as described by Hensel et al., Science,269:400-403 (1995) and pooled to form the “recovered pool.” Eachrecovered pool was made from at least 10,000 bacterial colonies. Thesignature tags present in the recovered pools were compared with thesignature tags present in the inoculum pools by PCR amplifying the tagsusing primers P12 and P13 (described in section B above), DIG labellingthe tags, and hybridizing the labelled tags to the 96 plasmids which hadbeen transferred to Hybond N⁺ membranes using the Bio-DotMicrofiltration Apparatus as described above in section C.

E. Virulence Gene Identification and DNA Sequencing

A total of 13 pools, each comprising 96 S. aureus Tn917 mutants, werescreened as described above for loss of virulence in mice. From these,50 mutants were identified whose tags hybridized strongly to probes fromthe inoculum pools but weakly to probes from the corresponding recoverypools.

To clone the chromosomal DNA flanking the transposon insertion points ofthese 50 mutants, 3-5 μg of S. aureus chromosomal DNA from each mutantwas isolated as described by Pospiech and Neumann, Trends Genetc.,11:217-218 (1995) and completely digested with HindIII. Half of thedigested DNA was then subjected to Southern hybridization analysis usingthe pBR322 fragment of pID408 as a probe to determine the size of theDNA fragment carrying this part of the transposon. The rest of the DNAwas resuspended in 200 μl of ligation buffer (Gibco-BRL) andself-ligated overnight at 16° C. The ligated products were transformedinto E. coli DH5α (Gibco) and plated onto Luria Bertani (LB) agarcontaining 50 μg/ml ampicillin, and incubated at 37° C. overnight. Asingle amp^(R) colony from each transformation was grown up in LB brothcontaining 100 μg/ml amp. Plasmid DNA was extracted using a Qiagenplasmid miniprep kit. Chromosomal DNA flanking the transposon wasobtained using primer pseq-1 (5′-TGAACTGCCACTGTAGAGAGA-3′, SEQ ID NO:102) based on the erm-proximal end of Tn917, and sequenced (for astretch of several hundred nucleotides) using a Model 373A SequencingSystem (Applied Biosystems). The sequence obtained is indicated in Table1 below with reference to the corresponding nucleotides of the SEQ IDNO:.

These DNA sequences were analyzed by searching the S. aureus databasefrom Human Genome Sciences and the European Molecular Biology Laboratoryand Genbank DNA and Protein Databases using the BLAST and FASTA networkservice at the Human Genome Mapping Project Recourse Centre, Hinxton,UK. Results of the searches done to date in identifying the virulencegenes and their possible function are shown in Table 1. Table 1 belowdisplays the signature tag identification number, the SEQ ID NO:corresponding to each virulence gene (and the nucleotide positionswithin the SEQ ID NO: that correspond to the actual stretch of DNA thatwas sequenced), the possible function of the gene, and the LD₅₀ of theknockout mutant as determined in Example 2 below.

EXAMPLE 2 Assessment of Virulence of S. aureus Knockout Mutants

The virulence of individual knockout mutants, identified in Example 1 ashaving reduced virulence when tested in pools of 96 mutants, wasassessed by carrying out an LD₅₀ determination for each mutant.Confirmation that a given mutation is in a virulence gene may beobtained by comparing the parent and mutated bacterial strains in termsof whether both strains are equally effective in establishing aninfection with the same consequences for the host animal. In practice,this may be done by comparing LD₅₀ values (the number of bacteriarequired to produce a 50% mortality rate in the animals understandardized conditions) between the wild-type strain and the mutantderivative strain. If the LD₅₀ values are within standard error of eachother then the mutation is not in or does not affect a virulence gene.If significant differences occur between the wild-type and the mutantstrain, where the mutant strain is significantly less able to causelethal infection, then the mutation is in a virulence gene ordramatically affects a virulence gene.

Briefly, LD₅₀ determinations were conducted as follows. CF1 female miceweighing between 19 and 22 grams were injected intraperitoneally with aset concentration of either wild-type or mutant S. aureus in 0.2 ml ofBHI broth containing 4% (w/v) dried brewer's yeast as an adjuvant forestablishment of infection. The amount of brewer's yeast may vary from0-8% dependent upon the bacteria being utilized in the infection. Asingle LD₅₀ determination utilizes five log-dilutions of bacteria (10³to 10⁷), with each log-dilution being tested in 10 mice. The actualnumber of bacteria employed was estimated in each LD₅₀ determination bya plate count conducted on the bacterial stock to determine the numberof CFU of bacteria. Following infection, the mice were monitored dailyfor mortality for a period of at least one week. At the end of theobservation period the LD₅₀ was determined using probit analysis on themortality data. The LD₅₀ value of the wild-type S. aureus strain RN6390ranges from 1.4×10⁴ to 1.4×10⁵. The LD₅₀ values for the virulence genemutants are shown in Table 1 below.

TABLE 1 VIRULENCE GENE SEQUENCES AND POSSIBLE FUNCTION SEQ PositionIDENTI- ID of DNA FICATION SIGNA- NO: sequence OF LD₅₀ of TURE (DNA;obtained POSSIBLE knockout TAG amino in FUNCTION mutant ID acid) Example1 OF GENE (#CFU)* p2c73 SEQ ID 517-756 FemA/B like with >6.8 × 10⁸ NO:1; 2 ⁻40% identity p2c90 SEQ ID 281-431 Unknown; near NO: 3 Opp operonp9b74 SEQ ID 593-711 Tryptophan synthase NO: 4; 5 alpha chain TrpAp11c29 SEQ ID 1107-1260 FemB NO: 6; 7 p13c83 SEQ ID 1107-1260 FemB  3.9× 10⁵ NO: 6; 7 p4c15 SEQ ID 360-654 MarR/LuxR-like NO: 8; regulatory 9 &10 protein p6c63 SEQ ID 388-659 MarR/LuxR-like >5.4 × 10⁸ NO: 8;regulatory 9 & 10 protein p5c4 SEQ ID 192-481 41% identity to NO: 11;nitrate reductase 12 NirQ p9b66 SEQ ID 221-474 Peptide methionine  4.0 ×10⁴ NO: 13; sulfoxide 14 reductase p10c15 SEQ ID 562-825Oligopeptide >2.6 × 10⁸ NO: 17 & transporter 19; OppD 18 & 20 p13b74 SEQID all Possible stem-loop  8.8 × 10⁴ NO: 21 termination sequence; noobvious ORF p13c72 SEQ ID all Possible stem-loop NO: 21 terminationsequence; no obvious ORF p14c15 SEQ ID  669-1009 C-term of aspartokinaseNO: 22; 2 alpha subunit and 23 & 24 N-term of aspartate semialdeydedehydrogenase Asd; both involved in homoserine synthesis p13b26 SEQ ID442-819 homoserine kinase  7.7 × 10⁴ NO: 25; 26 p7c18 SEQ ID 667-847diaminopimelate NO: 27; decarboxylase 28 LysA; lysine synthesis p15c31SEQ ID 1594-2018 phosphoribosylformyl- NO: 29; glyinamide 30decarboxylase PurL; purine synthesis p10b18 SEQ ID  3-404 tryptophansynthase >5.2 × 10⁸ NO: 31; alpha chain 32 trpA p6b18 SEQ ID  3-404tryptophan synthase NO: 31; alpha chain 32 trpA p10b66 SEQ ID  30-282tryptophan synthase >4.2 × 10⁶ NO: 33; beta chain trpB 34 p10c34 SEQ ID609-817 Anthranilate   >4 × 10⁶ NO: 35; phosphoribosyl 36 transferaseTrpD; tryptophan synthesis p4c27 SEQ ID 1130-1254 dihydrolipoamide NO:37; succinyl transferase 38 component(e2) of 2-oxoglutaratedehydrogenase complex in TCA cycle p4c52 SEQ ID 498-738 dihydrolipoamideNO: 39; succinyl transferase 40 component(e2) of 2-oxoglutaratedehydrogenase complex in TCA cycle p10b2 SEQ ID  880-1159dihydrolipoamide >2.6 × 10⁸ NO: 41; succinyl transferase 42component(e2) of 2-oxoglutarate dehydrogenase complex in TCA cyclep10c20 SEQ ID all Sequence 3′ to  3.7 × 10⁵ NO: 43 the dihydrolipoamidesuccinyl transferase component(e2) of 2-oxoglutarate dehydrogenasecomplex in TCA cycle; no obvious ORF p12c32 SEQ ID 188-438Alpha-ketoglutarate >1.0 × 10⁶ NO: 44; dehydrogenase 45 (e1) of the2-oxoglutarate dehydrogenase complex in TCA cycle; acetyl CoA synthasep10b30 SEQ ID 2419-2574 Heat shock protein  1.4 × 10⁴ NO: 46; ClpB 47p13c3 SEQ ID 454-765 35% identity to  1.8 × 10⁴ NO: 48; hypothetical45.9 49 kDa protein from B. subtilis ImpB/MucB/ SamB family p4b3 SEQ ID 790-1203 Unknown; homology to NO: 50; hypothetical proteins 51 from B.subtilis YAAD and YAAE p4c63 SEQ ID 576-806 Unknown; metallo- NO: 52;peptidase motif 53 p5c3 SEQ ID  1-450 Unknown; membrane  7.1 × 10⁴ NO:54; transporter motif 55 p8d26 SEQ ID  32-262 Unknown; 38% NO: 56 &identity to 58; 57 & C. elegans 59 ORF p9b65 SEQ ID  1-361sodium/proton >9.2 × 10⁸ NO: 60; dependent alanine 61 carrier proteinp10b32 SEQ ID 205-296 Unknown NO: 62; 63 p10b85 SEQ ID  38-212 UnknownNO: 64 p10b89 SEQ ID  1-205 Unknown; prokaryotic >2.2 × 10⁸ NO: 65;membrane lipoprotein 66 lipid attachment motif p10c30 SEQ ID  730-1041Unknown; sensor-type  8.8 × 10⁵ NO: 67; protein motif 68 p10c52 SEQ ID 1-262 Unknown NO: 69; 70 p10d9 SEQ ID 422-639 Unknown; enterotoxin- NO:71; like motif 72 p11c12 SEQ ID  1-150 Unknown; 40% identity >4.2 × 10⁶NO: 73; to Mycoplasma mycoides 74 hypothetical protein in ffh 5′ regionq01444 p11c66 SEQ ID 292-529 Unknown; near Opp NO: 75 & operon 77; 76 &78 p5c34 SEQ ID 1180-1236 glycine betaine NO: 79; transporter; region 8055% identity to B. subtilis p54417 p10c18 SEQ ID  1-477Unknown-potential NO: 81 ORF in 120 bp p12c3 SEQ ID  1-318 acetyl-CoAsynthetase; NO: 82; region 59% 83 identity to B. subtilis p39062 p14b25SEQ ID  1-148 Unknown; exonuclease- NO: 84; like; 25% identity to 85 E.coli p13458; contains ATP binding motif p14b74 SEQ ID 243-624 Unknown;51% identity NO: 86; to hypothetical 87 45.9 kd protein (YQJW) from B.subtilis p54560 p14c13 SEQ ID  1-441 Unknown; near OPP NO: 88 operonp15b9 SEQ ID 654-867 Unknown; 41% identity NO: 89; to hypothetical 9033.7 kd protein (YHCT) from B. subtilis p54604 p15b32 SEQ ID 286-370Unknown; similar to NO: 91; orf1 5′ of 92 acvB. of Agro. tumefaciensa36922 p15c4 SEQ ID  25-192 4-oxalocrotonate NO: 93; tautomerase; 42% 94identity to Pseudo. putida a43397 *The LD₅₀ for wild-type S. aureusRN6390 is normally 1.4 × 10⁴ to 1.4 × 10⁵.

EXAMPLE 3 Use of Virulence Gene Products in Screen for Anti-bacterialAgents

The virulence genes and their gene products are utilized in assays foridentifying new anti-bacterial agents against S. aureus. The genes arecloned, the proteins encoded by the genes are produced and purified,high throughput assays are established to screen for inhibitors, and theinhibitors identified in the primary screen are evaluated in secondaryassays. The cloning and expression of FemA, and its use in screeningpotential inhibitory agents, is described below.

A. Cloning of FemA Gene

DNA restriction digestions and ligations are performed as described bySambrook et al., Molecular cloning: a laboratory manual. 2nd ed., ColdSpring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1989). Cloningof the S. aureus femA gene [Berger-Bachi et al., Mol. Gen. Genet.,219:263-269 (1989)] for intracellular expression in E. coli is carriedout as follows. To clone the gene with six histidine residues at the3′-terminus, the following two oligonucleotides primers,5′-GGCCATCGATAATGAAATTAATTAACGAGAGACAAATAGG-3′, SEQ ID NO: 103, and5′-GGCCGGATCCCTAGTGATGGTGATGGTGATGAAAAATTCTGTCTTTAACTTTTTT-3′, SEQ IDNO: 104, are used to PCR amplify the gene from S. aureus genomic DNA.The resulting femA gene has a ClaI site at 5′-end and a BamHI site at3′-end for cloning into E. coli expression vectors. For secretion of thefemA gene with six histidine residues at the 3′-terminus into theperiplasm of E. coli, PCR amplification using the following twooligonucleotide primers, 5′-GGCCGGTACCAAATTAATTAACGAGAGACAAATAGG-3′, SEQID NO: 105 and5′-GGCCGGATCCCTAGTGATGGTGATGGTGATGAAAAATTCTGTCTTTAACTTTTTT-3′, SEQ IDNO: 106, results in a fragment with a KpnI site at the 5′-end and aBamHI site at the 3′-end. The approximately 1.3 kb PCR fragment may besequenced directly, or after cloning into an expression vector, toconfirm the sequence of the femA gene and its flanking regions.

B. Recombinant FemA Protein Production and Purification

Expression of femA in E. coli is carried out as follows. The E. coliexpression vector pSRtac-pelB contains the synthetic tac promoter andthe signal sequence of PelB [Lei et al., J. Bact., 169:4379-4383(1987)], with transcription terminators upstream and downstream tominimize transcription read through. For intracellular expression, thefemA gene generated by PCR as described above in section A with ClaI andBamHI sites at the termini is cloned into pSRtac-pelB digested with ClaIand BamHI (replacing the PelB signal sequence). For secretion intoperiplasm, the femA gene generated by PCR as described above in sectionA with KpnI and BamHI sites at the termini is cloned into pSRtac-pelBdigested with KpnI and BamHI, directly downstream from the pelB signalsequence. E. coli are transformed with the expression vector containingthe femA gene. To induce the tac promoter and optimize expression level,the transformed bacteria are incubated with IPTG(isopropylthio-β-galactoside) at concentrations ranging from 2×10⁻³ M to1×10⁻⁵ M, at cell growth temperatures between 27° C. to 42° C. in richmedium or minimal medium.

The recombinant FemA thus produced contains six histidine residues atthe C-termninus and can be purified via the affinity of histidine tonickel. Chromatography may be carried out with the metal chelating resinNitrilotriacetic acid (Ni-NTA) (available from Qiagen, Chatsworth,Calif.) under native or denaturing conditions.

C. High Throughput Assays Using FemA for Screening Potential Inhibitors

The screening method described in U.S. Pat. No. 5,585,277 is used asfollows to establish a high throughput assay to identify ligands thatbind to FemA. The test ligand and femA are combined under conditions(e.g., temperature, pH, salt concentration, time) appropriate fordetecting binding of femA to a ligand. If too little target protein isunfolded, the observed signal will occur at too low a level or rate tobe conveniently measured. The conditions are optimized using knownmethods. Binding of a test ligand to femA is assessed in one of severalways: by determining the extent to which folded femA is present in thetest ligand-femA combination, by determining the extent to whichunfolded femA is present in the test ligand-femA combination, or bydetermining the ratio of folded femA to unfolded femA in thecombination. There are numerous methods to carry out thesedeterminations. For example, proteolysis may be used. A protease whichacts preferentially on unfolded femA is combined with the testligand-femA combination, and after an appropriate period of incubation,the difference between intact or degraded FemA in the presence and inthe absence of the test ligand is determined. The addition of a testligand which binds the folded femA, thus stabilizing it in theprotease-resistant form, changes the rate of proteolysis. A wide varietyof known proteases, such as trypsin, chymotrypsin, V8 protease,elastase, carboxypeptidase, proteinase K, thermolysin and subtilisin,can be used. As another example, the binding of the test ligand to thefemA is assessed through the use of antibodies that specifically bind tothe protein only in the unfolded state. There are numerous methods knownin the art for producing antibodies to a particular protein.

D. In Vivo Assays to Evaluate Inhibitors

The inhibitors/binders identified by the initial screens such as the onedescribed in section C above are evaluated for their effect on virulencein mouse models of S. aureus infection. Mice are infected with 100LD₅₀'s of the wild-type bacteria and treated with varying doses of thetest inhibitory agent. The ED₅₀ of the test inhibitory agent (the amountin milligrams of drug per kilogram of body weight required to cure 50%of infected animals) is determined. Each trial contains two controlgroups, a negative control group given an LD₅₀ dose of bacteria and notest inhibitory agent (which ensures that the infecting dose isoperationally close to 100 LD₅₀'s), and a positive control group given a100 LD₅₀ dose of bacteria and an ED₅₀ dose of a known and effectiveantibiotic for that bacterial infection (which confirms that theinfection can be cured under appropriate conditions).

Five dosage levels of test inhibitory agent, utilizing serial two-folddilutions, are employed. An exemplary dosage scheme begins with aminimum of 6.25 mg of drug per kg of body weight per dose, and ranges upto a maximum dose of 100 mg/kg. Each dosage level is tested in six miceand deaths following infection and treatment are monitored for at least6 days. At the end of the test period, probit analysis is employed todetermine the ED₅₀ value of the inhibitor or the amount of drug inmilligrams of drug per kilogram of body weight required to reversevirulence so as to result in only 50% mortality of infected animals inthe test. In tests designed to detect the prevention of infection byinterference with virulence factors, the first dose of the inhibitor isgiven one hour prior to infection, and then four and eight hours postinfection. In tests designed to detect virulence inhibitors whichreverse the outcome of infection, the inhibitor is administered one andfour hours post infection on the first day of infection and once in themorning and afternoon (separated by 6-8 hours) on the second and thirddays following infection. Inhibitory agents that successfully preventthe establishment of an infection or reverse the outcome of an infectiononce established are identified.

EXAMPLE 4 Svr, a Virulence Regulator of Staphylococcus aureus

The identification of large number of putative virulence genes inStaphylococcus aureus using signature-tagged mutagenesis is reported inExamples 1 to 3. Many of these had no significant similarity tosequences in the DNA and protein databases. To investigate further therole of these genes in virulence, mutant strains were subjected to aseries of phenotypic tests. One mutant strain was found with pleiotropiceffects on virulence factors. We thus designated the mutated gene inthis strain staphylococcal virulence regulator (svr). Compared with itsparent wild-type strain, the svr mutant strain expressed greatly reducedamounts of α-, β- and δ-toxins but increased amounts of protein A. DNAsequence analysis of the cloned svr gene did not reveal any significantsimilarities to entries in the DNA and protein databases. Northernhybridization with probes specific for hla, the gene encoding α-toxin,and spa, the gene encoding protein A, showed that svr affected theexpression of α-toxin and protein A at the mRNA level. Northern analysisof svr, agr and RNAIII transcripts showed that agr mRNA and RNAIII weredetectable in the wild-type strain and an svr mutant strain carrying thesvr gene on a plasmid but was absent in the agr⁻ and svr⁻ strains. Svrtranscripts were present in wild-type and agr⁻ strains but were notdetected in the svr⁻ strain. This indicates that svr is required for theexpression of agr and RNAIII. The svr product is therefore likely to bea novel compenent in the agr regulatory network controlling virulence ofS. aureus.

Methods

Bacterial Strains and Plasmids

Bacterial strains and plasmids are listed in Table 1. S. aureus strainswere grown in Brain Heart Infusion (BHI) medium (Difco) with or withoutagar (1.5%) and antibiotics (erythromycin [erm] at 20 μg/ml and/orchloramphenicol [cm] at 20 μg/ml). E. coli strain DH5a was grown inLuria Bertani (LB) medium with or without ampicillin [amp] at 50 μg/ml).

Table 1. Bacterial strains and plasmids used in this study

TABLE 1 Bacterial strains and plasmids used in this study Strain Sourceor or plasmid Phenotype or characteristics reference Strains E. coliF-supE44 ΔlacU169 (Φ801acZΔM15) Gibo BRL DH5α hsdR17 recA1 gyr thi-1 S.aureus RN4220 NCTC8325-4, restriction minus. Peng et al. 1988 RN6390Propagated laboratory strain Peng et al. 1988 RN6911 RN6390 Δagr. Penget al. 1988 P6C63 svr strain. selected by STM Examples 1 to 3 NewmanWild-type strain NCTC10833 Wood46 An α-hemolysin-producing strain.NCTC10345 ID401 Clinical isolate from Hammersmith This study Hospital,Mec^(r) ID402 Clinical isolate from Hammersmith This study Hospital,Mec^(r) Plasmids pBR322 Cloning vector, Amp^(r), Tc^(r). New EnglandBioLab pSP72 Cloning vector, Amp^(r). Gibco BRL pVA380-1 Cloning vectorof S. aureus. Tc^(r). Macrina et al. 1980 pCW59 Cloning vector, Tc^(r)Wilson et al. 1981 pID431 Screened from pBR322 S. aureus library. Thisstudy Contains svr, orf2 and orf3. pID413 Cloning vector, derived fromThis study pVA380-1, Tc^(r). pID437 pID413 carrying svr. This studypID439 pID413 carrying svr, orf2 and orf3. This study pID4311 pID413carrying svr and orf2 plus 287 bp This study upstream sequence of orf2pID4312 pID413 carrying svr, orf2 and orf3 plus This study 287 bpupstream sequence of orf2.

DNA Manipulations, PCR, Digoxygenin Labelling and Hybridizations

Chromosomal DNA from Staphylococcus aureus was isolated as described byPospiech et al., (8). DNA restriction digestions and modifications wereperformed as described by Sambrook et al., (9). Plasmid DNA from S.aureus strain RN4220 was isolated using a Qiagen Plasmid Miniprep Kitaccording to the manufacturer's protocol except that the bacterial cellswere lysed by lysostaphin (Sigma; 200 mg/ml) at 37° C. for 30 min priorto plasmid purification.

Construction of Plasmids

Plasmid pID413 was derived from pVA380-1(10). A 2.5 kb fragment carryingthe pVA380-1 replicon was PCR-amplified from pVA380-1. Restriction sitesfor BglII and HindIII were introduced in the 5′ end of the amplifiedfragment by using forward primer 5′-TGGAGATCTAAGCTTTGCATAACTTTCTCGTCC-3′(SEQ ID No 107) and reverse primer 5′-TCCTGGCGATTCTGAGAC-3′(SEQ ID No108). The amplified fragment was filled in with DNA polymerase Klenowfragment and ligated with a 2.3 kb DNA fragment carrying thetetracycline resistance gene from pCW59 after digestion by HindIII andfilling in with DNA polymerase Klenow fragment, resulting in plasmidpID413. A DNA polylinker of plasmid pSP72 was digested with BglII andHindIII and inserted into BglII and HindIII digested pID413 to generatepID413PL.

Construction of Genomic Library of S. aureus.

A S. aureus (RN6390) chromosomal DNA library was constructed in pBR322as follows: chromosomal DNA was partially digested with BamHI and EcoRIto an average size of 5 kb and purified by phenol:chloroform extraction.The purified DNA fragments were ligated with BamHI and EcoRI digestedpBR322. This ligation product was transformed into E. coli DH5a byelectroporation and plated on LB agar plate containing 50 μg/mlampicillin.

Complementation of Svr Mutant P6C63

Four DNA fragments with different sizes were cloned into pID413PL tocomplement mutant strain P6C63 (FIG. 1A). All of these four fragmentswere amplified by PCR using primers based on the DNA sequence flankingthe svr gene. Restriction sites for BamHI and HindIII were introduced inthe 5′ end and the 3′ end of each fragment, respectively. The amplifiedfragments were digested by BamHI and HindIII and inserted into BamHI andHindIII digested pID413PL to generate pID437, pID439, pID4311 andpID4312. The svr gene was amplified from genomic DNA of RN6390 by PCRusing primers 5′-TGGGGATCCGATAAGTGTGACTGGTAG-3′ (SEQ ID No 109) and5′-TGGAAGCTTACATTACTTCAAATAAATTA-3′ (SEQ ID No. 110) to generate pID437.A 2.1 kb fragment containing svr, orf2 and orf3 was amplified by PCRusing primers 5′-TGGGGATCCTGCATATCAAAATGTTTATGGC-3′ (SEQ ID No. 111) and5′-TGGAAGCTTACACATATGCCAATCTCAC-3′ (SEQ ID No. 112) to generate pID439.A 1.3 kb fragment containing svr and orf2 plus 287 bp upstream sequenceof orf2 was amplified by PCR using primers5′-GTTGGATCCGCTGTTGTTACTTTGATGC-3′ (SEQ ID No. 113) and5′-TGGAAGCTTACATTACTTCAAATAAATTA-3′ (SEQ ID No. 114) to generatepID4311. A 2.4 kb fragment containing svr, orf2, and orf3 plus 287 bpupstream sequence of orf2 was amplified by PCR using primers5′-GTTGGATCCGCTGTTGTTACTTTGATGC-3′ (SEQ ID No. 115) and5′-TGGAAGCTTACACATATGCCAATCTCAC-3′ (SEQ ID No 116) to generate pID4312.All of these four plasmids were individually transformed into P6C63 byelectroporation. Transformants were selected by resistance totetracycline and tested for restoration of wild-type phenotype.

Phenotypic Characterization

Lipase activity was assayed on 1% Tween agar plates (Difco). DNaseproduction was assayed on DNase agar (Difco). Coagulase was assayed bythe method described by Smeltzer et al. (11). α-, β- and δ-toxins wereassayed on cross-streaked sheep, rabbit and horse blood agar plates.Protein A was detected by Western blot using anti-protein A monoclonalantibody (Sigma).

Southern and Northern Blot Analysis

Southern hybridization analysis was performed as described by Sambrooket al (9) with DIG-labelling DNA fragments as probes. For Northernhybridization. total RNA from S. aureus was isolated by using QuiagenRNA easy kit according to the manufacturer's protocol except that thebacterial cells were lysed by lysostaphin (Sigma; 200 mg/ml) at 37° C.for 3-5 min. Equal amounts of RNA were separated on 1.2% agarose gelscontaining 0.66 M formaldehyde and transferred onto nitrocellulosemembrane. Hybridizations were carried out at 42° C. All probes wereradiolabelled with [(α-³²P]dATP by PCR amplification using PCR-generatedDNA fragments as templates. Oligonucleotide primers used for PCR were asfollows: amplification of an hla fragment was performed with primer H1(5′-ATTTGATATGTCTCAACTGC-3′) (SEQ ID No 117) and H2(5′-GCTCTAATTTTTAAGTGAGG-3′) (SEQ ID No 118). For amplification of spa,primers used were S1 (5′-TATCTGGTGGCGTAACACCTG-3′) (SEQ ID No 119) andS2 (5′-GATGAAGCCGTTACGTTGTTC-3′) (SEQ ID No 120). For agr, primers A1(5′-GCCATAAGGATGTGAATGTATG-3′) (SEQ ID No 121) and A2(5′-GCATTTGCTAGTTATCTTG-3′) (SEQ ID No. 122) were used. Primers R1(5′-AGATCTATCAAGGATGTGATGGTT-3′) (SEQ ID No 123) and R2(5′-GTCATTATACGATTTAGTACAATC-3′) (SEQ ID No 124) were used for theamplification of RNAIII.

Reverse Transcription-polymerase Chain Reaction (RT-PCR)

Total RNA (1 mg) from post-exponential phase cultures of bacteria wasreverse transcribed using First-strand cDNA synthesis kit (PharmaciaBiotech) according to the manufacturer's instructions. PCR was performedin a volume of 100 ml with 10 ml cDNA sample, 200 pmol of each primerand 200 nM of dNTPs and 2.5 U of Taq-DNA polymerase (Sigma). PCRproducts were analysed by agarose gel electrophoresis.

Infection Studies

For single strain infection studies, CD-1 female mice (20 g; CharlesRiver Labs) were individually injected intraperitoneally with 0.2 ml ofa suspension containing a 5×10⁵ cfu bacteria and 2% (w/v) Brewer's yeastin BHI broth. At 6, 24, 48, 72, 96 and 120 hours post-injection, twomice were killed from each group. Dilution series of spleen homogenateswere spread over BHI agar plates and incubated at 37° C. overnight. Thenumber of bacterial cfu were counted at each time point.

Results

Cloning and Sequencing of the Svr Region

To clone the chromosomal region surrounding the transposon insertionsite of strain P6C63, a genomic library of S. aureus was generated inplasmid pBR322. By hybridising colonies from this genomic library with aprobe consisting of a 0.5 kb DNA flanking the transposon insertion, onepositive clone was identified. Restriction analysis of this recombinantplasmid revealed a 2.5 kb AluI fragment which had been disrupted byTn917 in P6C63. SEQ ID Nos: 8 and 10 are the nucleotide and amino acidsequences initially identified for P6C63. Transposon Tn917 had insertedinto one of three open reading frames (orfs) in this fragment. This orfwas designated svr (FIG. 1A). It is 372 bp in length and codes for aprotein of 124 amino acid residues (FIG. 1B; SEQ ID No: 125). Uponfurther sequencing of P6C63 the nucleotide and amino acid sequences weredetermined as SEQ ID No: 126 and SEQ ID No: 127. FASTA and BLASTsearches of the protein databases revealed that svr and orf3 had nosignificant similarity to known proteins or motifs. However, orf2,immediately upstream of svr showed significant similarity (28% identityover 148 amino acid residues) to the Escherichia coli slyA, which is amember of the mar family of transcription factors of Bacillus subtilis.

To establish whether the virulence defect of strain P6C63 was due tointerruption of svr or a polar effect on orf3 or other genes of apossible operon, a series of plasmids was constructed and transformedinto P6C63 to test for complementation of defects in α-, β- and δ-toxinproduction (see below). The insert in the smallest complementing plasmid(pID437) contained only the svr gene (FIG. 1A), indicating that thetoxin-deficient phenotype is due to mutation of the svr gene. Whenplasmids pID437, pID439, pID4311 and pID4312 were transformed intoP6C63, only a proportion of transformants showed complete restoration ofthe wild-type phenotype. A Southern analysis was performed on thesetransformants to test for the presence of the plasmid in thecomplemented strains, using the svr gene as the probe. Southernhybridization showed that in the complemented transformants, the plasmidhad integrated into the chromosomal DNA (data not shown), whereas theuncomplemented transformants carried unintegrated plasmids.

To demonstrate whether other strains of S. aureus, particularly clinicalisolates also contain the svr gene, a Southern hybridization analysiswas performed on S. aureus strains RN6390, Newman, Wood 46, ID401 andID402, which were from different clinical sources in the USA and UK.Chromosomal DNA from each strain was digested with HindIII and probedwith the svr gene. A common 8.2 kb fragment was observed in strainRN6390, Newman, Wood 46 and ID401, whereas a 10 kb fragment was observedin strain ID402 (FIG. 1C). This indicates that svr is widely conservedin S. aureus.

Phenotypic Characterization

To investigate the svr⁻ phenotype in more detail, P6C63 was subjected toa number of tests for S. aureus virulence determinants. The expressionlevels of a-, b- and d-toxins were examined on different blood agarplates. The expression levels of a-, b-, and d-toxins are greatlyreduced in P6C63 and RN6911 (an agr⁻ strain) compared with theirparental strain RN6390, and the production of toxins was restored inP6C63 when transformed with pID437 (FIG. 2A).

A characteristic of agr⁻ strains is the overproduction of cell wallprotein A. In view of the similar toxin-deficient phenotype of agr⁻ andsvr⁻ strains, we examined protein A by Western blot using ananti-protein A monoclonal antibody. As shown in FIG. 2B, protein A wasnot detectable in the wild-type strain, was barely detectable in thecomplemented strain, and was present in both agr⁻ and svr⁻ strains.

Virulence Studies

Mutant P6C63 was originally isolated by STM in a pool of 96 mutants andLD₅₀ analysis has shown that it is highly attenuated in virulencecompared with the wild-type strain RN6390 (7). To study the growthkinetics of P6C63 in more detail, strains RN6390 and P6C63 were injectedintraperitoneally into groups of CD-1 mice at a dose of 1×10⁵ cfu ofbacteria. Bacteria were recovered from spleens at different time pointsand cfu quantified. The bacterial load of RN6390 in spleens exceeded 10⁶cfu at 6, 24, 48 and 72 hours post-inoculation. The numbers of P6C63cells increased over the first 6 hours, then decreased, and wereeventually cleared by 96 hours. Mice inoculated with RN6390 appearedvery sick compared with those injected with the mutant strain and one of18 mice died in 24 hours.

Transcriptional Analysis of Svr

As attempts to detect svr mRNA in RN6390 by Northern hybridization werenot successful, RT-PCR was employed to determine the transcription ofsvr in various genetic backgrounds. Total RNA from P6C63 (svr⁻), RN6911(agr⁻) and RN6390 (wild-type) were subjected to RT-PCR using primerscorresponding to the svr DNA sequence. As shown in FIG. 4A, svrtranscripts were detected in RN6911 and RN6390 but not in P6C63. Acontrol RT-PCR was performed by using same conditions as above exceptthat reverse transcriptase was inactived at 95° C. for 5 min. No PCRproducts were obtained for any of the three strains, indicating that theproducts shown in FIG. 4A were not due to DNA contamination.

Effect of SVR on Transcription of hla and spa.

As the svr mutant strain showed reduced production of α-, β- andδ-toxins and increased levels of protein A, Northern hybridizations wereperformed to determine whether they were affected at the mRNA level.Total RNA extracted from post-exponential phase cultures of RN6390,RN6911, P6C63 and the complemented strain P6C63 (pID437) were subjectedto Northern analysis using probes specific for hla and spa. With the hlaprobe, a strong band was observed in RNA from the wild-type strainRN6390 and the complemented strain but was not observed in either theagr⁻ strain RN6911 or svr⁻strain P6C63. The membranes were stripped andreprobed with spa gene, and in contrast to hla, a hybridizing band wasidentified in RNA from RN6911 and P6C63 but not in strain RN6390 or thecomplemented strain (FIG. 4B). These results are consistent with theprotein analysis (above) and indicated that the svr affected theexpression of α-toxin and protein A at the mRNA level.

Svr is Required for the Transcription of Agr and RNAIII

Since phenotypic analysis and Northern hybridization results indicatedthat the svr mutant has similar characteristics to those of anagr⁻strain, Northern hybridizations were performed to investigatewhether svr is related to the agr regulatory system. Total RNA isolatedfrom postexponential phase cultures of RN6390, RN6911, P6C63 and thecomplemented strain was subjected to Northern hybridizations usingprobes specific for agr and RNAIII. As shown in FIG. 4C, both the agrand RNAIII probes hybridized to RNA isolated from the wild-type strainand complemented strain, but did not hybridize to RNA from the svrmutant strain or the agr⁻ strain. This result suggests that svr isrequired for the transcription of agr and RNAIII. Using STM, an approachfor the identification of bacterial virulence genes (7), we haveidentified svr, a novel staphylococcal virulence regulator whose mutantphenotype is similar to that of an agr⁻ strain. However, DNA sequenceanalysis of svr shows that it not a member of the known agr regulatorysystem, and has no similarity to known proteins in the DNA and proteindatabases. The svr⁻ mutant phenotype was complemented by a plasmidcontaining svr indicating that the phenotypic defect was not due to apolar effect on genes downstream of svr. Interestingly, complementationoccurred in cells containing chromosomally integrated plasmid, but notin cells containing non-integrated plasmids. The reason for this isunclear, but as the plasmid copy number is in the order of 15-25/cell(10), it could be related to over-expression of svr. Certainly theabundance of svr mRNA seems to be very low in wild-type cells, as it wasnot detectable by Northern hybridization using RNA recovered frombacteria at various stages of growth.

The phenotypic and Northern hybridization analysis of toxins and proteinA in the svr mutant showed it is similar pattern to that of an agr⁻mutant strain. We therefore asked whether svr is linked with agrregulatory system. The svr mRNA was detected in an agr⁻ strain andwild-type strain but not in the svr⁻ strain, suggesting that agr has noeffect on svr activity. By contrast, agr mRNA and RNAIII were detectedin the wild-type strain and svr⁻-complemented strain, but not in thesvr⁻ strain. This suggests that svr is required for the expression ofagr and RNAIII.

It was recently shown that agr is autoinduced by a proteinaceous factorcalled RAP (RNAIII activating protein), which is produced and secretedby S. aureus. (16). Although the sequence of RAP has not been published,it seems unlikely that svr encodes the RAP protein, because the lengthof svr is 372 bp, which could encode a 13 kDa protein, and RAP is a 38kDa protein (16). The fact that sar is also necessary for the expressionof agr and RNAIII (6) indicates that three distinct regulatory pathwayscould influence the expression of the agr locus and virulence factors ofS. aureus. Studies on the relationship between svr, sar and RAP willhelp to clarify the mechanisms by which S. aureus controls virulencegene expression.

The recent emergence of methicillin resistant S. aureus (MRSA)represents a very important public health problem (18). Thus there is anurgent need for alternative approaches to control S. aureus infections.Balaban et al. (17) reported that mice vaccinated with RAP wereprotected from infection by a subsequent S. aureus challenge. The factthat svr is required for virulence, most likely through regulation ofagr and RNAIII, suggests that svr might be another target for thedevelopment of antibiotics and vaccines against S. aureus infections.

REFERENCES FOR EXAMPLE 4

1. Sherris, J. C. and Plorde, J. J. (1990) Staphylococci. In MedicalMicrobiology: An Introduction to Infectous Diseases, ed. Sherris, J. C.(Elsvier Science Publishing Co. Inc., New York), pp. 275-289.

2. Peng, H.-L, Novick, R. P., Kreiswirth, B., Kornblum, J. andSchlievert, P. (1988) J. Bacteriol 170: 4365-4372

3. Cheung, A. L., Koomey, J. M., Butler, C. A., Projan, S. I., andFischetti, V. A. (1992) Proc Natl Acad Sci USA 89: 6462-4466.

4. Novick, R. P., S. J. Projan, J. Kronblum, H. F. Ross, G. Ji, B.Kreiswirth, F. Vandenesch and S. Moghazeh. (1995) Mol Gen Genet 248:446-458.

5. Novick, R. P., H. F. Ross, S. J. Projan, J. Kronblum, B. Kreiswirth,and S. Moghazeh. (1993) EMBO J. 12: 3967-3975.

6. Cheung A. L., M. G. Bayer and Heinrichs J. H. (1997) J. Bacteriol179:3963-3971.

7. Mei J-M., Nourbakhsh F., Ford C. W. and Holden, D. W. (1997) MolMicrobiol 26:399-407

8. Pospiech, A., and Neumann, B. (1995) Trends Genet 11: 217-218.

9. Sambrook, J., Fritsch, E. F., and Maniatis, T. (1989) Molecularcloning: a laboratory manual. 2nd ed. Cold Spring Harbor LaboratoryPress. Cold Spring Harbor. N.Y.

10. Macrina, F. L., P. H. Wood, and Jones K. R. (1980) Infect Immun28:692-699.

11. Srneltzer, M. S., Hart, M. E., Iandolo, J. J. (1993) Infect Immun61: 919-25 925.

12. Morfeldt, E., Tegmark, K & Arvidson, S. (1996) Mol. Microbiol.21:1227-1237.

13. Cheung, A. L., Bayer, M. G., and Heinrichs, J. H. (1997) J.Bacteiol. 179:3963-3971

14. Novick, R. P., H. F. Ross, S. J. Projan, J. Kronblum, B. Kreiswirth,and Moghazeh, S. (1993) EMBO J. 12: 3967-3975.

15. Schenk, S. and Laddaga, R. A. (1992) FEMS Microbiology Letters 94:133-138.

16. Balaban, N. and Novick, R. P. (1995) Proc. Natl. Acad. Sci. USA 92:1619-1623.

17. Balaban, N., Goldkorn, T., Nhan, R. T., Dang, L. B., Scott, S.,Ridgley, R. M., Rassooly, A., Wright, A. C., Larrick, J. W., Rasooly. R.and Carlson, J. R. (1998) Science 280: 438-440.

18. Mulligan M. E., K. A. Murray-Leisure, B. S. Ribner, H. C.Standiford, J. F. John, J. A. Korvick, C. A. Kauffinan and Yu V. L.(1993)z The American J. Med. 94:313-328.

Numerous modifications and variations of the above-described inventionare expected to occur to those of skill in the art. Accordingly, onlysuch limitations as appear in the appended claims should be placedthereon.

127 1260 base pairs nucleic acid single linear other nucleic acid /desc= “DNA (genomic) (p2c73)” CDS 1..1257 1 ATG AAA TTT ACA GAG TTA ACT GTTACC GAA TTT GAC AAC TTT GTA CAA 48 Met Lys Phe Thr Glu Leu Thr Val ThrGlu Phe Asp Asn Phe Val Gln 1 5 10 15 AAT CCA TCA TTG GAA AGT CAT TATTTC CAA GTA AAA GAA AAT ATA GTT 96 Asn Pro Ser Leu Glu Ser His Tyr PheGln Val Lys Glu Asn Ile Val 20 25 30 ACC CGT GAG AAT GAT GGC TTT GAA GTAGTT TTA TTA GGT ATT AAA GAC 144 Thr Arg Glu Asn Asp Gly Phe Glu Val ValLeu Leu Gly Ile Lys Asp 35 40 45 GAC AAT AAC AAA GTA ATT GCA GCA AGC CTTTTC TCT AAA ATT CCT ACT 192 Asp Asn Asn Lys Val Ile Ala Ala Ser Leu PheSer Lys Ile Pro Thr 50 55 60 ATG GGA AGT TAT GTT TAC TAT TCG AAT CGT GGTCCA GTA ATG GAT TTT 240 Met Gly Ser Tyr Val Tyr Tyr Ser Asn Arg Gly ProVal Met Asp Phe 65 70 75 80 TCA GAT TTA GGA TTA GTT GAT TAT TAT TTA AAAGAG TTA GAT AAA TAT 288 Ser Asp Leu Gly Leu Val Asp Tyr Tyr Leu Lys GluLeu Asp Lys Tyr 85 90 95 TTA CAG CAA CAT CAA TGT TTA TAT GTT AAA TTA GATCCG TAT TGG TTA 336 Leu Gln Gln His Gln Cys Leu Tyr Val Lys Leu Asp ProTyr Trp Leu 100 105 110 TAT CAT CTA TAT GAT AAA GAT ATC GTG CCA TTT GAAGGT CGC GAG AAA 384 Tyr His Leu Tyr Asp Lys Asp Ile Val Pro Phe Glu GlyArg Glu Lys 115 120 125 AAT GAT GCC CTA GTA AAC TTG TTT AAA TCA CAT GGTTAC GAG CAT CAT 432 Asn Asp Ala Leu Val Asn Leu Phe Lys Ser His Gly TyrGlu His His 130 135 140 GGC TTT ACA ACT GAG TAT GAT ACA TCG AGC CAA GTACGA TGG ATG GGC 480 Gly Phe Thr Thr Glu Tyr Asp Thr Ser Ser Gln Val ArgTrp Met Gly 145 150 155 160 GTA TTA AAC CTT GAA GGT AAA ACA CCC GAA ACATTG AAA AAG ACA TTT 528 Val Leu Asn Leu Glu Gly Lys Thr Pro Glu Thr LeuLys Lys Thr Phe 165 170 175 GAT AGT CAA CGT AAA CGT AAT ATT AAT AAA GCGATA AAC TAT GGT GTT 576 Asp Ser Gln Arg Lys Arg Asn Ile Asn Lys Ala IleAsn Tyr Gly Val 180 185 190 AAA GTC AGA TTC CTT GAA CGT GAT GAG TTC AATCTT TTC TTA GAT TTA 624 Lys Val Arg Phe Leu Glu Arg Asp Glu Phe Asn LeuPhe Leu Asp Leu 195 200 205 TAT CGT GAA ACT GAA GAG CGT GCT GGA TTT GTATCA AAA ACA GAT GAT 672 Tyr Arg Glu Thr Glu Glu Arg Ala Gly Phe Val SerLys Thr Asp Asp 210 215 220 TAT TTT TAT AAC TTT ATT GAC ACA TAT GGA GATAAA GTA TTA GTA CCA 720 Tyr Phe Tyr Asn Phe Ile Asp Thr Tyr Gly Asp LysVal Leu Val Pro 225 230 235 240 TTA GCA TAT ATT GAC CTT GAT GAA TAT GTGTTA AAG TTG CAA CAG GAA 768 Leu Ala Tyr Ile Asp Leu Asp Glu Tyr Val LeuLys Leu Gln Gln Glu 245 250 255 TTG AAT GAC AAA GAA AAT CGT CGT GAT CAAATG ATG GCG AAA GAA AAC 816 Leu Asn Asp Lys Glu Asn Arg Arg Asp Gln MetMet Ala Lys Glu Asn 260 265 270 AAA TCA GAT AAG CAA ATG AAG AAA ATT GCAGAA TTA GAT AAG CAA ATT 864 Lys Ser Asp Lys Gln Met Lys Lys Ile Ala GluLeu Asp Lys Gln Ile 275 280 285 GAT CAT GAT CAG CAT GAA TTA TTG AAT GCAAGT GAA TTG AGC AAA ACG 912 Asp His Asp Gln His Glu Leu Leu Asn Ala SerGlu Leu Ser Lys Thr 290 295 300 GAC GGC CCA ATT CTA AAC CTT GCT TCT GGCGTT TAT TTT GCA AAT GCA 960 Asp Gly Pro Ile Leu Asn Leu Ala Ser Gly ValTyr Phe Ala Asn Ala 305 310 315 320 TAT GAA GTG AAT TAT TTC TCT GGT GGTTCA TCA GAA AAA TAT AAT CAA 1008 Tyr Glu Val Asn Tyr Phe Ser Gly Gly SerSer Glu Lys Tyr Asn Gln 325 330 335 TTT ATG GGA CCA TAC ATG ATG CAT TGGTTT ATG ATT AAC TAT TGC TTC 1056 Phe Met Gly Pro Tyr Met Met His Trp PheMet Ile Asn Tyr Cys Phe 340 345 350 GAT AAT GGC TAT GAT CGT TAT AAT TTCTAT GGT TTA TCA GGT GAT TTT 1104 Asp Asn Gly Tyr Asp Arg Tyr Asn Phe TyrGly Leu Ser Gly Asp Phe 355 360 365 ACG GAA AAC AGT GAA GAT TAT GGC GTATAC CGC TTT AAA CGT GGA TTT 1152 Thr Glu Asn Ser Glu Asp Tyr Gly Val TyrArg Phe Lys Arg Gly Phe 370 375 380 AAT GTA CAA ATC GAA GAA TTA ATA GGGGAT TTC TAT AAA CCA ATT CAT 1200 Asn Val Gln Ile Glu Glu Leu Ile Gly AspPhe Tyr Lys Pro Ile His 385 390 395 400 AAA GTG AAA TAT TGG TTG TTC ACAACA TTG GAT AAA TTA CGT AAA AAA 1248 Lys Val Lys Tyr Trp Leu Phe Thr ThrLeu Asp Lys Leu Arg Lys Lys 405 410 415 TTA AAG AAA TAG 1260 Leu Lys Lys419 amino acids amino acid linear protein 2 Met Lys Phe Thr Glu Leu ThrVal Thr Glu Phe Asp Asn Phe Val Gln 1 5 10 15 Asn Pro Ser Leu Glu SerHis Tyr Phe Gln Val Lys Glu Asn Ile Val 20 25 30 Thr Arg Glu Asn Asp GlyPhe Glu Val Val Leu Leu Gly Ile Lys Asp 35 40 45 Asp Asn Asn Lys Val IleAla Ala Ser Leu Phe Ser Lys Ile Pro Thr 50 55 60 Met Gly Ser Tyr Val TyrTyr Ser Asn Arg Gly Pro Val Met Asp Phe 65 70 75 80 Ser Asp Leu Gly LeuVal Asp Tyr Tyr Leu Lys Glu Leu Asp Lys Tyr 85 90 95 Leu Gln Gln His GlnCys Leu Tyr Val Lys Leu Asp Pro Tyr Trp Leu 100 105 110 Tyr His Leu TyrAsp Lys Asp Ile Val Pro Phe Glu Gly Arg Glu Lys 115 120 125 Asn Asp AlaLeu Val Asn Leu Phe Lys Ser His Gly Tyr Glu His His 130 135 140 Gly PheThr Thr Glu Tyr Asp Thr Ser Ser Gln Val Arg Trp Met Gly 145 150 155 160Val Leu Asn Leu Glu Gly Lys Thr Pro Glu Thr Leu Lys Lys Thr Phe 165 170175 Asp Ser Gln Arg Lys Arg Asn Ile Asn Lys Ala Ile Asn Tyr Gly Val 180185 190 Lys Val Arg Phe Leu Glu Arg Asp Glu Phe Asn Leu Phe Leu Asp Leu195 200 205 Tyr Arg Glu Thr Glu Glu Arg Ala Gly Phe Val Ser Lys Thr AspAsp 210 215 220 Tyr Phe Tyr Asn Phe Ile Asp Thr Tyr Gly Asp Lys Val LeuVal Pro 225 230 235 240 Leu Ala Tyr Ile Asp Leu Asp Glu Tyr Val Leu LysLeu Gln Gln Glu 245 250 255 Leu Asn Asp Lys Glu Asn Arg Arg Asp Gln MetMet Ala Lys Glu Asn 260 265 270 Lys Ser Asp Lys Gln Met Lys Lys Ile AlaGlu Leu Asp Lys Gln Ile 275 280 285 Asp His Asp Gln His Glu Leu Leu AsnAla Ser Glu Leu Ser Lys Thr 290 295 300 Asp Gly Pro Ile Leu Asn Leu AlaSer Gly Val Tyr Phe Ala Asn Ala 305 310 315 320 Tyr Glu Val Asn Tyr PheSer Gly Gly Ser Ser Glu Lys Tyr Asn Gln 325 330 335 Phe Met Gly Pro TyrMet Met His Trp Phe Met Ile Asn Tyr Cys Phe 340 345 350 Asp Asn Gly TyrAsp Arg Tyr Asn Phe Tyr Gly Leu Ser Gly Asp Phe 355 360 365 Thr Glu AsnSer Glu Asp Tyr Gly Val Tyr Arg Phe Lys Arg Gly Phe 370 375 380 Asn ValGln Ile Glu Glu Leu Ile Gly Asp Phe Tyr Lys Pro Ile His 385 390 395 400Lys Val Lys Tyr Trp Leu Phe Thr Thr Leu Asp Lys Leu Arg Lys Lys 405 410415 Leu Lys Lys 431 base pairs nucleic acid single linear other nucleicacid /desc = “DNA (genomic) (p2c90)” 3 GACGTGCTGA TGCAAATATA AGTNCATGACCATCATGTTG TAATTGTNAC AATGTATCAA 60 TAATAGTCTG GTCAATTAAT CGGCCGTCAAAACAAAGCGT ACCATCAATA TCANAGACAA 120 ATCTCATCAN ATCACTCCAA ACAATATAATACCATGATTA TAGCATAAGT NAGTCATGTN 180 ACGATATTAG ATAATGATTA TNATGTAGGGTACCTTTTGC CTTACACACA TATNACTTCC 240 TATNATATAT ATTATGTCAA CNNGAATGTNAAATTCCATA AGGGGACTTA TATAACTGTN 300 TGTCTGTNTA GTGTTTATGT CAGTCAGCTAAATTNACATT CATGTTATGT CTCATTAAAC 360 CAATTACTCA CGTNTTGGTG CATATCNCATCTTTCATATC GTCATACATC TATCCTCATT 420 CTCNTGNCTG A 431 729 base pairsnucleic acid single linear other nucleic acid /desc = “DNA (genomic)(p9b74)” CDS 1..726 4 ATG ACT AAA TTA TTT ATA CCT TAT ATT ATG GGC AATAAA GAT TTG ATT 48 Met Thr Lys Leu Phe Ile Pro Tyr Ile Met Gly Asn LysAsp Leu Ile 1 5 10 15 GAA AAT GCA ACA TTG TTG AGT GAA AAT GGT GCA GATATA ATT GAA ATT 96 Glu Asn Ala Thr Leu Leu Ser Glu Asn Gly Ala Asp IleIle Glu Ile 20 25 30 GGA GTA CCT TTC TCT GAT CCG GTT GCT GAT GGT CCA GTTATC ATG GAA 144 Gly Val Pro Phe Ser Asp Pro Val Ala Asp Gly Pro Val IleMet Glu 35 40 45 GCA GGT CAA CAA GCG ATT AAA CAA GGC ATC ACG ATA GAT TATATT TTC 192 Ala Gly Gln Gln Ala Ile Lys Gln Gly Ile Thr Ile Asp Tyr IlePhe 50 55 60 AAT CAA TTA GAA AAA CAT GGT GAT CAA ATT AAG TGT AAC TAT GTATTA 240 Asn Gln Leu Glu Lys His Gly Asp Gln Ile Lys Cys Asn Tyr Val Leu65 70 75 80 ATG ACG TAT TAT AAT ATT ATT TGT CAT TAT GGA GAA CAA GCG TTTTTT 288 Met Thr Tyr Tyr Asn Ile Ile Cys His Tyr Gly Glu Gln Ala Phe Phe85 90 95 GAA AAA TGT CGA GAT ACT GGT GTC TAC GGC TTA ATT ATT CCT GAT TTA336 Glu Lys Cys Arg Asp Thr Gly Val Tyr Gly Leu Ile Ile Pro Asp Leu 100105 110 CCA TAT GAA TTA TCG CAG CGT TTA AAA CAA CAA TTT AGT CAC TAT GGC384 Pro Tyr Glu Leu Ser Gln Arg Leu Lys Gln Gln Phe Ser His Tyr Gly 115120 125 GTC AAA ATC ATA TCG TTA GTT GCG ATG ACT ACT GAT GAC AAA CGT ATA432 Val Lys Ile Ile Ser Leu Val Ala Met Thr Thr Asp Asp Lys Arg Ile 130135 140 AAA GAT ATC GTA TCC CAT GCG GAA GGC TTT ATT TAT ACT GTG ACG ATG480 Lys Asp Ile Val Ser His Ala Glu Gly Phe Ile Tyr Thr Val Thr Met 145150 155 160 AAT GCG ACA ACA GGG CAA AAC GGT GCG TTT CAT CCA GAA TTA AAACGA 528 Asn Ala Thr Thr Gly Gln Asn Gly Ala Phe His Pro Glu Leu Lys Arg165 170 175 AAA ATT GAG TCA ATT AAA GCG ATA GCC AAT GTG CCA GTT GTC GCAGGA 576 Lys Ile Glu Ser Ile Lys Ala Ile Ala Asn Val Pro Val Val Ala Gly180 185 190 TTT GGT ATA AGA ACA CCA CAA CAT GTT GCA GAT ATA AAA GAG GTTGCA 624 Phe Gly Ile Arg Thr Pro Gln His Val Ala Asp Ile Lys Glu Val Ala195 200 205 GAT GGC ATT GTC ATT GGT AGC GAA ATC GTT AAG CGA TTT AAA TCTAAC 672 Asp Gly Ile Val Ile Gly Ser Glu Ile Val Lys Arg Phe Lys Ser Asn210 215 220 ACG CGT GAG GAA ATC ATT AAA TAT TTA CAA TCT ATC CAA CAA ACATTG 720 Thr Arg Glu Glu Ile Ile Lys Tyr Leu Gln Ser Ile Gln Gln Thr Leu225 230 235 240 AAT AAT TAA 729 Asn Asn 242 amino acids amino acidlinear protein 5 Met Thr Lys Leu Phe Ile Pro Tyr Ile Met Gly Asn Lys AspLeu Ile 1 5 10 15 Glu Asn Ala Thr Leu Leu Ser Glu Asn Gly Ala Asp IleIle Glu Ile 20 25 30 Gly Val Pro Phe Ser Asp Pro Val Ala Asp Gly Pro ValIle Met Glu 35 40 45 Ala Gly Gln Gln Ala Ile Lys Gln Gly Ile Thr Ile AspTyr Ile Phe 50 55 60 Asn Gln Leu Glu Lys His Gly Asp Gln Ile Lys Cys AsnTyr Val Leu 65 70 75 80 Met Thr Tyr Tyr Asn Ile Ile Cys His Tyr Gly GluGln Ala Phe Phe 85 90 95 Glu Lys Cys Arg Asp Thr Gly Val Tyr Gly Leu IleIle Pro Asp Leu 100 105 110 Pro Tyr Glu Leu Ser Gln Arg Leu Lys Gln GlnPhe Ser His Tyr Gly 115 120 125 Val Lys Ile Ile Ser Leu Val Ala Met ThrThr Asp Asp Lys Arg Ile 130 135 140 Lys Asp Ile Val Ser His Ala Glu GlyPhe Ile Tyr Thr Val Thr Met 145 150 155 160 Asn Ala Thr Thr Gly Gln AsnGly Ala Phe His Pro Glu Leu Lys Arg 165 170 175 Lys Ile Glu Ser Ile LysAla Ile Ala Asn Val Pro Val Val Ala Gly 180 185 190 Phe Gly Ile Arg ThrPro Gln His Val Ala Asp Ile Lys Glu Val Ala 195 200 205 Asp Gly Ile ValIle Gly Ser Glu Ile Val Lys Arg Phe Lys Ser Asn 210 215 220 Thr Arg GluGlu Ile Ile Lys Tyr Leu Gln Ser Ile Gln Gln Thr Leu 225 230 235 240 AsnAsn 1260 base pairs nucleic acid single linear other nucleic acid /desc= “DNA (genomic) (p11c29/p13c83)” CDS 1..1257 6 ATG AAA TTT ACA GAG TTAACT GTT ACC GAA TTT GAC AAC TTT GTA CAA 48 Met Lys Phe Thr Glu Leu ThrVal Thr Glu Phe Asp Asn Phe Val Gln 1 5 10 15 AAT CCA TCA TTG GAA AGTCAT TAT TTC CAA GTA AAA GAA AAT ATA GTT 96 Asn Pro Ser Leu Glu Ser HisTyr Phe Gln Val Lys Glu Asn Ile Val 20 25 30 ACC CGT GAG AAT GAT GGC TTTGAA GTA GTT TTA TTA GGT ATT AAA GAC 144 Thr Arg Glu Asn Asp Gly Phe GluVal Val Leu Leu Gly Ile Lys Asp 35 40 45 GAC AAT AAC AAA GTA ATT GCA GCAAGC CTT TTC TCT AAA ATT CCT ACT 192 Asp Asn Asn Lys Val Ile Ala Ala SerLeu Phe Ser Lys Ile Pro Thr 50 55 60 ATG GGA AGT TAT GTT TAC TAT TCG AATCGT GGT CCA GTA ATG GAT TTT 240 Met Gly Ser Tyr Val Tyr Tyr Ser Asn ArgGly Pro Val Met Asp Phe 65 70 75 80 TCA GAT TTA GGA TTA GTT GAT TAT TATTTA AAA GAG TTA GAT AAA TAT 288 Ser Asp Leu Gly Leu Val Asp Tyr Tyr LeuLys Glu Leu Asp Lys Tyr 85 90 95 TTA CAG CAA CAT CAA TGT TTA TAT GTT AAATTA GAT CCG TAT TGG TTA 336 Leu Gln Gln His Gln Cys Leu Tyr Val Lys LeuAsp Pro Tyr Trp Leu 100 105 110 TAT CAT CTA TAT GAT AAA GAT ATC GTG CCATTT GAA GGT CGC GAG AAA 384 Tyr His Leu Tyr Asp Lys Asp Ile Val Pro PheGlu Gly Arg Glu Lys 115 120 125 AAT GAT GCC CTA GTA AAC TTG TTT AAA TCACAT GGT TAC GAG CAT CAT 432 Asn Asp Ala Leu Val Asn Leu Phe Lys Ser HisGly Tyr Glu His His 130 135 140 GGC TTT ACA ACT GAG TAT GAT ACA TCG AGCCAA GTA CGA TGG ATG GGC 480 Gly Phe Thr Thr Glu Tyr Asp Thr Ser Ser GlnVal Arg Trp Met Gly 145 150 155 160 GTA TTA AAC CTT GAA GGT AAA ACA CCCGAA ACA TTG AAA AAG ACA TTT 528 Val Leu Asn Leu Glu Gly Lys Thr Pro GluThr Leu Lys Lys Thr Phe 165 170 175 GAT AGT CAA CGT AAA CGT AAT ATT AATAAA GCG ATA AAC TAT GGT GTT 576 Asp Ser Gln Arg Lys Arg Asn Ile Asn LysAla Ile Asn Tyr Gly Val 180 185 190 AAA GTC AGA TTC CTT GAA CGT GAT GAGTTC AAT CTT TTC TTA GAT TTA 624 Lys Val Arg Phe Leu Glu Arg Asp Glu PheAsn Leu Phe Leu Asp Leu 195 200 205 TAT CGT GAA ACT GAA GAG CGT GCT GGATTT GTA TCA AAA ACA GAT GAT 672 Tyr Arg Glu Thr Glu Glu Arg Ala Gly PheVal Ser Lys Thr Asp Asp 210 215 220 TAT TTT TAT AAC TTT ATT GAC ACA TATGGA GAT AAA GTA TTA GTA CCA 720 Tyr Phe Tyr Asn Phe Ile Asp Thr Tyr GlyAsp Lys Val Leu Val Pro 225 230 235 240 TTA GCA TAT ATT GAC CTT GAT GAATAT GTG TTA AAG TTG CAA CAG GAA 768 Leu Ala Tyr Ile Asp Leu Asp Glu TyrVal Leu Lys Leu Gln Gln Glu 245 250 255 TTG AAT GAC AAA GAA AAT CGT CGTGAT CAA ATG ATG GCG AAA GAA AAC 816 Leu Asn Asp Lys Glu Asn Arg Arg AspGln Met Met Ala Lys Glu Asn 260 265 270 AAA TCA GAT AAG CAA ATG AAG AAAATT GCA GAA TTA GAT AAG CAA ATT 864 Lys Ser Asp Lys Gln Met Lys Lys IleAla Glu Leu Asp Lys Gln Ile 275 280 285 GAT CAT GAT CAG CAT GAA TTA TTGAAT GCA AGT GAA TTG AGC AAA ACG 912 Asp His Asp Gln His Glu Leu Leu AsnAla Ser Glu Leu Ser Lys Thr 290 295 300 GAC GGC CCA ATT CTA AAC CTT GCTTCT GGC GTT TAT TTT GCA AAT GCA 960 Asp Gly Pro Ile Leu Asn Leu Ala SerGly Val Tyr Phe Ala Asn Ala 305 310 315 320 TAT GAA GTG AAT TAT TTC TCTGGT GGT TCA TCA GAA AAA TAT AAT CAA 1008 Tyr Glu Val Asn Tyr Phe Ser GlyGly Ser Ser Glu Lys Tyr Asn Gln 325 330 335 TTT ATG GGA CCA TAC ATG ATGCAT TGG TTT ATG ATT AAC TAT TGC TTC 1056 Phe Met Gly Pro Tyr Met Met HisTrp Phe Met Ile Asn Tyr Cys Phe 340 345 350 GAT AAT GGC TAT GAT CGT TATAAT TTC TAT GGT TTA TCA GGT GAT TTT 1104 Asp Asn Gly Tyr Asp Arg Tyr AsnPhe Tyr Gly Leu Ser Gly Asp Phe 355 360 365 ACG GAA AAC AGT GAA GAT TATGGC GTA TAC CGC TTT AAA CGT GGA TTT 1152 Thr Glu Asn Ser Glu Asp Tyr GlyVal Tyr Arg Phe Lys Arg Gly Phe 370 375 380 AAT GTA CAA ATC GAA GAA TTAATA GGG GAT TTC TAT AAA CCA ATT CAT 1200 Asn Val Gln Ile Glu Glu Leu IleGly Asp Phe Tyr Lys Pro Ile His 385 390 395 400 AAA GTG AAA TAT TGG TTGTTC ACA ACA TTG GAT AAA TTA CGT AAA AAA 1248 Lys Val Lys Tyr Trp Leu PheThr Thr Leu Asp Lys Leu Arg Lys Lys 405 410 415 TTA AAG AAA TAG 1260 LeuLys Lys 419 amino acids amino acid linear protein 7 Met Lys Phe Thr GluLeu Thr Val Thr Glu Phe Asp Asn Phe Val Gln 1 5 10 15 Asn Pro Ser LeuGlu Ser His Tyr Phe Gln Val Lys Glu Asn Ile Val 20 25 30 Thr Arg Glu AsnAsp Gly Phe Glu Val Val Leu Leu Gly Ile Lys Asp 35 40 45 Asp Asn Asn LysVal Ile Ala Ala Ser Leu Phe Ser Lys Ile Pro Thr 50 55 60 Met Gly Ser TyrVal Tyr Tyr Ser Asn Arg Gly Pro Val Met Asp Phe 65 70 75 80 Ser Asp LeuGly Leu Val Asp Tyr Tyr Leu Lys Glu Leu Asp Lys Tyr 85 90 95 Leu Gln GlnHis Gln Cys Leu Tyr Val Lys Leu Asp Pro Tyr Trp Leu 100 105 110 Tyr HisLeu Tyr Asp Lys Asp Ile Val Pro Phe Glu Gly Arg Glu Lys 115 120 125 AsnAsp Ala Leu Val Asn Leu Phe Lys Ser His Gly Tyr Glu His His 130 135 140Gly Phe Thr Thr Glu Tyr Asp Thr Ser Ser Gln Val Arg Trp Met Gly 145 150155 160 Val Leu Asn Leu Glu Gly Lys Thr Pro Glu Thr Leu Lys Lys Thr Phe165 170 175 Asp Ser Gln Arg Lys Arg Asn Ile Asn Lys Ala Ile Asn Tyr GlyVal 180 185 190 Lys Val Arg Phe Leu Glu Arg Asp Glu Phe Asn Leu Phe LeuAsp Leu 195 200 205 Tyr Arg Glu Thr Glu Glu Arg Ala Gly Phe Val Ser LysThr Asp Asp 210 215 220 Tyr Phe Tyr Asn Phe Ile Asp Thr Tyr Gly Asp LysVal Leu Val Pro 225 230 235 240 Leu Ala Tyr Ile Asp Leu Asp Glu Tyr ValLeu Lys Leu Gln Gln Glu 245 250 255 Leu Asn Asp Lys Glu Asn Arg Arg AspGln Met Met Ala Lys Glu Asn 260 265 270 Lys Ser Asp Lys Gln Met Lys LysIle Ala Glu Leu Asp Lys Gln Ile 275 280 285 Asp His Asp Gln His Glu LeuLeu Asn Ala Ser Glu Leu Ser Lys Thr 290 295 300 Asp Gly Pro Ile Leu AsnLeu Ala Ser Gly Val Tyr Phe Ala Asn Ala 305 310 315 320 Tyr Glu Val AsnTyr Phe Ser Gly Gly Ser Ser Glu Lys Tyr Asn Gln 325 330 335 Phe Met GlyPro Tyr Met Met His Trp Phe Met Ile Asn Tyr Cys Phe 340 345 350 Asp AsnGly Tyr Asp Arg Tyr Asn Phe Tyr Gly Leu Ser Gly Asp Phe 355 360 365 ThrGlu Asn Ser Glu Asp Tyr Gly Val Tyr Arg Phe Lys Arg Gly Phe 370 375 380Asn Val Gln Ile Glu Glu Leu Ile Gly Asp Phe Tyr Lys Pro Ile His 385 390395 400 Lys Val Lys Tyr Trp Leu Phe Thr Thr Leu Asp Lys Leu Arg Lys Lys405 410 415 Leu Lys Lys 731 base pairs nucleic acid single linear othernucleic acid /desc = “DNA (genomic) CDS 1..417 CDS 524..730 8 ATG GAATTC ACT TAT TCG TAT TTA TTT AGA ATG ATT AGT CAT GAG ATG 48 Met Glu PheThr Tyr Ser Tyr Leu Phe Arg Met Ile Ser His Glu Met 1 5 10 15 AAA CAAAAG GCT GAT CAA AAG TTA GAG CAA TTT GAT ATT ACA AAT GAG 96 Lys Gln LysAla Asp Gln Lys Leu Glu Gln Phe Asp Ile Thr Asn Glu 20 25 30 CAA GGT CATACG TTA GGT TAT CTT TAT GCA CAT CAA CAA GAT GGA CTG 144 Gln Gly His ThrLeu Gly Tyr Leu Tyr Ala His Gln Gln Asp Gly Leu 35 40 45 ACA CAA AAT GATATT GCT AAA GCA TTA CAA CGA ACA GGT CCA ACT GTC 192 Thr Gln Asn Asp IleAla Lys Ala Leu Gln Arg Thr Gly Pro Thr Val 50 55 60 AGT AAT TTA TTA AGGAAC CTT GAA CGT AAA AAG CTG ATC TAT CGC TAT 240 Ser Asn Leu Leu Arg AsnLeu Glu Arg Lys Lys Leu Ile Tyr Arg Tyr 65 70 75 80 GTC GAT GCA CAA GATACG AGA AGA AAG AAT ATA GGG CTG ACT ACC TCT 288 Val Asp Ala Gln Asp ThrArg Arg Lys Asn Ile Gly Leu Thr Thr Ser 85 90 95 GGG ATT AAA CTC GTA GAAGCA TTC ACT TCG ATA TTT GAT GAA ATG GAA 336 Gly Ile Lys Leu Val Glu AlaPhe Thr Ser Ile Phe Asp Glu Met Glu 100 105 110 CAA ACA CTC GTA TCG CAGTTA TCT GAA GAA GAA AAT GAA CAA ATG AAA 384 Gln Thr Leu Val Ser Gln LeuSer Glu Glu Glu Asn Glu Gln Met Lys 115 120 125 GCA AAC TTA ACT AAA ATGTTA TCT AGT TTA CAA TAAATGATAA GTGTGACTG 437 Ala Asn Leu Thr Lys Met LeuSer Ser Leu Gln 130 135 TAGAAATCAG TCACTTTGTC TTTAATATTA TAGTTAGATATCTAATTGTT AGTAAGCTA 497 TTATTGGAAA AGACAAGGAG TATTGA ACA ATG AAA GACGAA CAA TTA TAT TAT 550 Thr Met Lys Asp Glu Gln Leu Tyr Tyr 1 5 TTT GAGAAA TCG CCA GTA TTT AAA GCG ATG ATG CAT TTC TCA TTG CCA 598 Phe Glu LysSer Pro Val Phe Lys Ala Met Met His Phe Ser Leu Pro 10 15 20 25 ATG ATGATA GGG ACT TTA TTA AGC GTT ATT TAT GGC ATA TTA AAT ATT 646 Met Met IleGly Thr Leu Leu Ser Val Ile Tyr Gly Ile Leu Asn Ile 30 35 40 TAC TTT ATAGGA TTT TYA GAM GAY AGC CAC ATG ATT TCT GCT AAT CTC 694 Tyr Phe Ile GlyPhe Xaa Xaa Asp Ser His Met Ile Ser Ala Asn Leu 45 50 55 TCT AAC ACT GCCAGT ATT TGC TAT CTT AAT GGG GTT A 731 Ser Asn Thr Ala Ser Ile Cys TyrLeu Asn Gly Val 60 65 139 amino acids amino acid linear protein 9 MetGlu Phe Thr Tyr Ser Tyr Leu Phe Arg Met Ile Ser His Glu Met 1 5 10 15Lys Gln Lys Ala Asp Gln Lys Leu Glu Gln Phe Asp Ile Thr Asn Glu 20 25 30Gln Gly His Thr Leu Gly Tyr Leu Tyr Ala His Gln Gln Asp Gly Leu 35 40 45Thr Gln Asn Asp Ile Ala Lys Ala Leu Gln Arg Thr Gly Pro Thr Val 50 55 60Ser Asn Leu Leu Arg Asn Leu Glu Arg Lys Lys Leu Ile Tyr Arg Tyr 65 70 7580 Val Asp Ala Gln Asp Thr Arg Arg Lys Asn Ile Gly Leu Thr Thr Ser 85 9095 Gly Ile Lys Leu Val Glu Ala Phe Thr Ser Ile Phe Asp Glu Met Glu 100105 110 Gln Thr Leu Val Ser Gln Leu Ser Glu Glu Glu Asn Glu Gln Met Lys115 120 125 Ala Asn Leu Thr Lys Met Leu Ser Ser Leu Gln 130 135 69 aminoacids amino acid linear protein 10 Thr Met Lys Asp Glu Gln Leu Tyr TyrPhe Glu Lys Ser Pro Val Phe 1 5 10 15 Lys Ala Met Met His Phe Ser LeuPro Met Met Ile Gly Thr Leu Leu 20 25 30 Ser Val Ile Tyr Gly Ile Leu AsnIle Tyr Phe Ile Gly Phe Xaa Xaa 35 40 45 Asp Ser His Met Ile Ser Ala AsnLeu Ser Asn Thr Ala Ser Ile Cys 50 55 60 Tyr Leu Asn Gly Val 65 828 basepairs nucleic acid single linear other nucleic acid /desc = ”DNA(genomic) (p5c4)“ CDS 1..825 11 GTG AGG TAC TTG CCT GTG AAT GCG ATT GAAATC CCG ACT ACC GCC GGC 48 Val Arg Tyr Leu Pro Val Asn Ala Ile Glu IlePro Thr Thr Ala Gly 1 5 10 15 ACG CCT GAC GCG CCC TTC TAC CAA CCG TTGGGC AAT GAA GAG CAG CTG 96 Thr Pro Asp Ala Pro Phe Tyr Gln Pro Leu GlyAsn Glu Glu Gln Leu 20 25 30 TTC CAG CAG GCC TGG CAG CAC GGC ATG CCC GTGCTT ATC AAG GGC CCG 144 Phe Gln Gln Ala Trp Gln His Gly Met Pro Val LeuIle Lys Gly Pro 35 40 45 ACC GGC TGC GGC AAG ACC CGT TTC GTA CAG CAC ATGGCG CAT CGC CTG 192 Thr Gly Cys Gly Lys Thr Arg Phe Val Gln His Met AlaHis Arg Leu 50 55 60 AAT CTG CCG CTG TAC ACC GTG GCC TGC CAT GAC GAC CTGTCG GCG GCC 240 Asn Leu Pro Leu Tyr Thr Val Ala Cys His Asp Asp Leu SerAla Ala 65 70 75 80 GAC CTG GTC GGC CGA CAC CTG ATC GGC GCA CAG GGC ACCTGG TGG CAG 288 Asp Leu Val Gly Arg His Leu Ile Gly Ala Gln Gly Thr TrpTrp Gln 85 90 95 GAC GGT CCG CTG ACC CGC GCG GTC CGC GAA GGA GGC ATC TGCTAC CTG 336 Asp Gly Pro Leu Thr Arg Ala Val Arg Glu Gly Gly Ile Cys TyrLeu 100 105 110 GAC GAA GTG GTG GAA GCA CGG CAG GAC ACC GCC GTG GTA CTGCAC CCG 384 Asp Glu Val Val Glu Ala Arg Gln Asp Thr Ala Val Val Leu HisPro 115 120 125 CTG GCC GAT GAT CGC CGC GAA CTG TTC ATC GAG CGC ACC GGCGAG GCG 432 Leu Ala Asp Asp Arg Arg Glu Leu Phe Ile Glu Arg Thr Gly GluAla 130 135 140 CTC AAG GCG CCG CCG GGC TTC ATG CTG GTG GTG TCC TAC AACCCC GGT 480 Leu Lys Ala Pro Pro Gly Phe Met Leu Val Val Ser Tyr Asn ProGly 145 150 155 160 TAC CAA AAC CTG CTC AAG GGC ATG AAG CCC AGC ACC CGCCAG CGC TTC 528 Tyr Gln Asn Leu Leu Lys Gly Met Lys Pro Ser Thr Arg GlnArg Phe 165 170 175 GTG GCG ATG CGC TTC GAC TAT CCG CCG ACC GCC GAG GAAGAG CGC ATC 576 Val Ala Met Arg Phe Asp Tyr Pro Pro Thr Ala Glu Glu GluArg Ile 180 185 190 GTC GCC AAC GAG GCG CAG GTC GAT GCC GCG CTC GCC GCCCAG GTG GTC 624 Val Ala Asn Glu Ala Gln Val Asp Ala Ala Leu Ala Ala GlnVal Val 195 200 205 AAG CTT GGC CAG GCA CTG CGT CGG CTG GAA CAG CAC GATCTG GAG GAA 672 Lys Leu Gly Gln Ala Leu Arg Arg Leu Glu Gln His Asp LeuGlu Glu 210 215 220 GTC GCC TCG ACC CGC CTG CTG ATC TTC ACC GCA CGC ATGATC CGC TCC 720 Val Ala Ser Thr Arg Leu Leu Ile Phe Thr Ala Arg Met IleArg Ser 225 230 235 240 GGC ATG ACG CCG CGG CAG GCC TGC CTG GCC TGC CTCGCC GAA CCG CTG 768 Gly Met Thr Pro Arg Gln Ala Cys Leu Ala Cys Leu AlaGlu Pro Leu 245 250 255 TCG GAT GAT CCG CAG ACC GTT GCC GCG CTG ATG GATGTG GTC TAT GTC 816 Ser Asp Asp Pro Gln Thr Val Ala Ala Leu Met Asp ValVal Tyr Val 260 265 270 CAC TTC GGC TGA 828 His Phe Gly 275 275 aminoacids amino acid linear protein 12 Val Arg Tyr Leu Pro Val Asn Ala IleGlu Ile Pro Thr Thr Ala Gly 1 5 10 15 Thr Pro Asp Ala Pro Phe Tyr GlnPro Leu Gly Asn Glu Glu Gln Leu 20 25 30 Phe Gln Gln Ala Trp Gln His GlyMet Pro Val Leu Ile Lys Gly Pro 35 40 45 Thr Gly Cys Gly Lys Thr Arg PheVal Gln His Met Ala His Arg Leu 50 55 60 Asn Leu Pro Leu Tyr Thr Val AlaCys His Asp Asp Leu Ser Ala Ala 65 70 75 80 Asp Leu Val Gly Arg His LeuIle Gly Ala Gln Gly Thr Trp Trp Gln 85 90 95 Asp Gly Pro Leu Thr Arg AlaVal Arg Glu Gly Gly Ile Cys Tyr Leu 100 105 110 Asp Glu Val Val Glu AlaArg Gln Asp Thr Ala Val Val Leu His Pro 115 120 125 Leu Ala Asp Asp ArgArg Glu Leu Phe Ile Glu Arg Thr Gly Glu Ala 130 135 140 Leu Lys Ala ProPro Gly Phe Met Leu Val Val Ser Tyr Asn Pro Gly 145 150 155 160 Tyr GlnAsn Leu Leu Lys Gly Met Lys Pro Ser Thr Arg Gln Arg Phe 165 170 175 ValAla Met Arg Phe Asp Tyr Pro Pro Thr Ala Glu Glu Glu Arg Ile 180 185 190Val Ala Asn Glu Ala Gln Val Asp Ala Ala Leu Ala Ala Gln Val Val 195 200205 Lys Leu Gly Gln Ala Leu Arg Arg Leu Glu Gln His Asp Leu Glu Glu 210215 220 Val Ala Ser Thr Arg Leu Leu Ile Phe Thr Ala Arg Met Ile Arg Ser225 230 235 240 Gly Met Thr Pro Arg Gln Ala Cys Leu Ala Cys Leu Ala GluPro Leu 245 250 255 Ser Asp Asp Pro Gln Thr Val Ala Ala Leu Met Asp ValVal Tyr Val 260 265 270 His Phe Gly 275 528 base pairs nucleic acidsingle linear other nucleic acid /desc = ”DNA (genomic) (p9b66)“ CDS1..525 13 ATG GAA AGG ATG TCA AAA ATG AAT ATT AAT ACA GCT TAT TTT GCCGGA 48 Met Glu Arg Met Ser Lys Met Asn Ile Asn Thr Ala Tyr Phe Ala Gly 15 10 15 GGT TGC TTT TGG TGT ATG ACG AAA CCA TTT GAC ACC TTT GAC GGC ATA96 Gly Cys Phe Trp Cys Met Thr Lys Pro Phe Asp Thr Phe Asp Gly Ile 20 2530 GAA AAA GTA ACT TCT GGA TAT ATG GGC GGA CAT ATT GAA AAT CCT ACT 144Glu Lys Val Thr Ser Gly Tyr Met Gly Gly His Ile Glu Asn Pro Thr 35 40 45TAC GAA CAA GTA AAA TCA GGT ACG AGT GGT CAT TTA GAA ACT GTT GAA 192 TyrGlu Gln Val Lys Ser Gly Thr Ser Gly His Leu Glu Thr Val Glu 50 55 60 ATTCAA TAT GAT GTT GCA TTA TTC TCA TAC AAT AAG TTA TTA GAA ATA 240 Ile GlnTyr Asp Val Ala Leu Phe Ser Tyr Asn Lys Leu Leu Glu Ile 65 70 75 80 TTTTTC TCA GTC ATT GAC CCA TTA GAT ACA GGT GGT CAA TAT CAA GAC 288 Phe PheSer Val Ile Asp Pro Leu Asp Thr Gly Gly Gln Tyr Gln Asp 85 90 95 CGT GGTCCT CAA TAT NAA ACA GCT ATT TTC TAC ACT AAT GAT CAT CAA 336 Arg Gly ProGln Tyr Xaa Thr Ala Ile Phe Tyr Thr Asn Asp His Gln 100 105 110 AAA GAACTC GCT GAG ACT TAT ATC GAG CAG CTT AAA AAT ACG ATT AAT 384 Lys Glu LeuAla Glu Thr Tyr Ile Glu Gln Leu Lys Asn Thr Ile Asn 115 120 125 GCT GATAAG GCA ATT GCA ACA AAA ATA YTA CCA GCG TCA CAA TTT TAC 432 Ala Asp LysAla Ile Ala Thr Lys Ile Leu Pro Ala Ser Gln Phe Tyr 130 135 140 AAA GCCGAA GAC TAT CAC CAA GAT TTT TAT AAG AAA AAT CCA GAG CGC 480 Lys Ala GluAsp Tyr His Gln Asp Phe Tyr Lys Lys Asn Pro Glu Arg 145 150 155 160 TATGCA GAA GAA CAA AAA ATA CGC CAA GAA TAC AAA AAT AAG CAA 525 Tyr Ala GluGlu Gln Lys Ile Arg Gln Glu Tyr Lys Asn Lys Gln 165 170 175 TAA 528 175amino acids amino acid linear protein 14 Met Glu Arg Met Ser Lys Met AsnIle Asn Thr Ala Tyr Phe Ala Gly 1 5 10 15 Gly Cys Phe Trp Cys Met ThrLys Pro Phe Asp Thr Phe Asp Gly Ile 20 25 30 Glu Lys Val Thr Ser Gly TyrMet Gly Gly His Ile Glu Asn Pro Thr 35 40 45 Tyr Glu Gln Val Lys Ser GlyThr Ser Gly His Leu Glu Thr Val Glu 50 55 60 Ile Gln Tyr Asp Val Ala LeuPhe Ser Tyr Asn Lys Leu Leu Glu Ile 65 70 75 80 Phe Phe Ser Val Ile AspPro Leu Asp Thr Gly Gly Gln Tyr Gln Asp 85 90 95 Arg Gly Pro Gln Tyr XaaThr Ala Ile Phe Tyr Thr Asn Asp His Gln 100 105 110 Lys Glu Leu Ala GluThr Tyr Ile Glu Gln Leu Lys Asn Thr Ile Asn 115 120 125 Ala Asp Lys AlaIle Ala Thr Lys Ile Leu Pro Ala Ser Gln Phe Tyr 130 135 140 Lys Ala GluAsp Tyr His Gln Asp Phe Tyr Lys Lys Asn Pro Glu Arg 145 150 155 160 TyrAla Glu Glu Gln Lys Ile Arg Gln Glu Tyr Lys Asn Lys Gln 165 170 175 702base pairs nucleic acid single linear other nucleic acid /desc = ”DNA(genomic) (p7c26)“ CDS 1..699 15 ATG ATT GAG TTA AAA CAT GTG ACT TTT GGTTAT AAT AAA AAG CAG ATG 48 Met Ile Glu Leu Lys His Val Thr Phe Gly TyrAsn Lys Lys Gln Met 1 5 10 15 GTG CTA CAA GAT ATC AAT ATT ACT ATA CCTGAT GGA GAA AAT GTT GGT 96 Val Leu Gln Asp Ile Asn Ile Thr Ile Pro AspGly Glu Asn Val Gly 20 25 30 ATT TTA GGC GAA AGT GGC TGT GGT AAA AGT ACGCTC GCT TCA TTG GTT 144 Ile Leu Gly Glu Ser Gly Cys Gly Lys Ser Thr LeuAla Ser Leu Val 35 40 45 CTT GGC TTG TTT AAA CCT GTT AAA GGA GAG ATT TACTTA AGT GAC AAT 192 Leu Gly Leu Phe Lys Pro Val Lys Gly Glu Ile Tyr LeuSer Asp Asn 50 55 60 GCT GTG TTA CCG ATT TTC CAA CAC CCT TTA ACT AGC TTTAAC CCT GAT 240 Ala Val Leu Pro Ile Phe Gln His Pro Leu Thr Ser Phe AsnPro Asp 65 70 75 80 TGG ACG ATT GAG ACC TCA TTA AAA GAA GCG TTA TAT TATTAC AGA GGT 288 Trp Thr Ile Glu Thr Ser Leu Lys Glu Ala Leu Tyr Tyr TyrArg Gly 85 90 95 CTA ACT GAT AAT ACT GCT CAG GAT CAA TTA TTA TTA CAA CATTTA TCT 336 Leu Thr Asp Asn Thr Ala Gln Asp Gln Leu Leu Leu Gln His LeuSer 100 105 110 ACT TTT GAG TTA AAC GCG CAA TTA TTG ACT AAA TTA CCA AGCGAA GTG 384 Thr Phe Glu Leu Asn Ala Gln Leu Leu Thr Lys Leu Pro Ser GluVal 115 120 125 AGT GGC GGA CAA TTA CAA AGA TTT AAT GTC ATG CGT TCG TTATTA GCA 432 Ser Gly Gly Gln Leu Gln Arg Phe Asn Val Met Arg Ser Leu LeuAla 130 135 140 CAG CCT CGC GTT TTA ATA TGT GAT GAG ATA ACT TCA AAT TTAGAT GTT 480 Gln Pro Arg Val Leu Ile Cys Asp Glu Ile Thr Ser Asn Leu AspVal 145 150 155 160 ATA GCT GAA CAA AAT GTA ATC AAT ATA TTA AAA GCG CAAACG ATT ACG 528 Ile Ala Glu Gln Asn Val Ile Asn Ile Leu Lys Ala Gln ThrIle Thr 165 170 175 AAC TTA AAT CAT TTT ATC GTT ATT TCT CAT GAT TTA TCCGTG TTA CAA 576 Asn Leu Asn His Phe Ile Val Ile Ser His Asp Leu Ser ValLeu Gln 180 185 190 CGC TTA GTT AAT AGA ATT ATC GTT CTT AAG GAT GGC ATGATA GTC GAT 624 Arg Leu Val Asn Arg Ile Ile Val Leu Lys Asp Gly Met IleVal Asp 195 200 205 GAT TTT GCA ATA GAG GAA TTA TTT AAT GTT GAT AGA CACCCT TAT ACA 672 Asp Phe Ala Ile Glu Glu Leu Phe Asn Val Asp Arg His ProTyr Thr 210 215 220 AAA GAA TTA GTG CAA GCA TTT TCA TAT TAG 702 Lys GluLeu Val Gln Ala Phe Ser Tyr 225 230 233 amino acids amino acid linearprotein 16 Met Ile Glu Leu Lys His Val Thr Phe Gly Tyr Asn Lys Lys GlnMet 1 5 10 15 Val Leu Gln Asp Ile Asn Ile Thr Ile Pro Asp Gly Glu AsnVal Gly 20 25 30 Ile Leu Gly Glu Ser Gly Cys Gly Lys Ser Thr Leu Ala SerLeu Val 35 40 45 Leu Gly Leu Phe Lys Pro Val Lys Gly Glu Ile Tyr Leu SerAsp Asn 50 55 60 Ala Val Leu Pro Ile Phe Gln His Pro Leu Thr Ser Phe AsnPro Asp 65 70 75 80 Trp Thr Ile Glu Thr Ser Leu Lys Glu Ala Leu Tyr TyrTyr Arg Gly 85 90 95 Leu Thr Asp Asn Thr Ala Gln Asp Gln Leu Leu Leu GlnHis Leu Ser 100 105 110 Thr Phe Glu Leu Asn Ala Gln Leu Leu Thr Lys LeuPro Ser Glu Val 115 120 125 Ser Gly Gly Gln Leu Gln Arg Phe Asn Val MetArg Ser Leu Leu Ala 130 135 140 Gln Pro Arg Val Leu Ile Cys Asp Glu IleThr Ser Asn Leu Asp Val 145 150 155 160 Ile Ala Glu Gln Asn Val Ile AsnIle Leu Lys Ala Gln Thr Ile Thr 165 170 175 Asn Leu Asn His Phe Ile ValIle Ser His Asp Leu Ser Val Leu Gln 180 185 190 Arg Leu Val Asn Arg IleIle Val Leu Lys Asp Gly Met Ile Val Asp 195 200 205 Asp Phe Ala Ile GluGlu Leu Phe Asn Val Asp Arg His Pro Tyr Thr 210 215 220 Lys Glu Leu ValGln Ala Phe Ser Tyr 225 230 1471 base pairs nucleic acid single linearother nucleic acid /desc = ”DNA (genomic) (p10c15)“ CDS 1..774 17 ATGATG AGT CTC ATT GAT ATA CAA AAT TTA ACA ATA AAG AAT ACT AGT 48 Met MetSer Leu Ile Asp Ile Gln Asn Leu Thr Ile Lys Asn Thr Ser 1 5 10 15 GAGAAA TCT CTT ATT AAA GGG ATT GAT TTG AAA ATT TTT AGT CAA CAG 96 Glu LysSer Leu Ile Lys Gly Ile Asp Leu Lys Ile Phe Ser Gln Gln 20 25 30 ATT AATGCC TTG ATT GGA GAG AGC GGC GCT GGA AAA AGT TTG ATT GCT 144 Ile Asn AlaLeu Ile Gly Glu Ser Gly Ala Gly Lys Ser Leu Ile Ala 35 40 45 AAA GCT TTACTT GAA TAT TTA CCA TTT GAT TTA AGC TGC ACG TAT GAT 192 Lys Ala Leu LeuGlu Tyr Leu Pro Phe Asp Leu Ser Cys Thr Tyr Asp 50 55 60 TCG TAC CAA TTTGAT GGG GAA AAT GTT AGT AGA TTG AGT CAA TAT TAT 240 Ser Tyr Gln Phe AspGly Glu Asn Val Ser Arg Leu Ser Gln Tyr Tyr 65 70 75 80 GGT CAT ACA ATTGGC TAT ATT TCT CAA AAT TAT GCA GAA AGT TTT AAC 288 Gly His Thr Ile GlyTyr Ile Ser Gln Asn Tyr Ala Glu Ser Phe Asn 85 90 95 GAC CAT ACT AAA TTAGGT AAA CAG TTA ACT GCG ATT TAT CGT AAG CAT 336 Asp His Thr Lys Leu GlyLys Gln Leu Thr Ala Ile Tyr Arg Lys His 100 105 110 TAT AAA GGT AGT AAAGAA GAG GCT TTG TCC AAA GTT GAT AAG GCT TTG 384 Tyr Lys Gly Ser Lys GluGlu Ala Leu Ser Lys Val Asp Lys Ala Leu 115 120 125 TCG TGG GTT AAT TTACAA AGC AAA GAT ATA TTA AAT AAA TAT AGT TTC 432 Ser Trp Val Asn Leu GlnSer Lys Asp Ile Leu Asn Lys Tyr Ser Phe 130 135 140 CAA CTT TCT GGG GGCCAA CTT GAA CGC GTA TAC ATA GCA AGC GTT CTC 480 Gln Leu Ser Gly Gly GlnLeu Glu Arg Val Tyr Ile Ala Ser Val Leu 145 150 155 160 ATG TTG GAG CCTAAA TTA ATC ATT GCA GAC GAA CCA GTT GCA TCA TTG 528 Met Leu Glu Pro LysLeu Ile Ile Ala Asp Glu Pro Val Ala Ser Leu 165 170 175 GAT GCT TTG AACGGT AAT CAA GTG ATG GAT TTA TTA CAG CAT ATT GTA 576 Asp Ala Leu Asn GlyAsn Gln Val Met Asp Leu Leu Gln His Ile Val 180 185 190 TTA GAA CAT GGTCAA ACA TTA TTT ATT ATC ACA CAT AAC TTA AGT CAT 624 Leu Glu His Gly GlnThr Leu Phe Ile Ile Thr His Asn Leu Ser His 195 200 205 GTA TTG AAA TATTGT CAG TAC ATT TAT GTT TTA AAA GAA GGT CAA ATC 672 Val Leu Lys Tyr CysGln Tyr Ile Tyr Val Leu Lys Glu Gly Gln Ile 210 215 220 ATT GAA CGA GGTAAT ATT AAT CAT TTC AAG TAT GAG CAT TTG CAT CCG 720 Ile Glu Arg Gly AsnIle Asn His Phe Lys Tyr Glu His Leu His Pro 225 230 235 240 TAT ACT GAACGT CTA ATT AAA TAT AGA ACA CAA TTA AAG AGG GAT TAC 768 Tyr Thr Glu ArgLeu Ile Lys Tyr Arg Thr Gln Leu Lys Arg Asp Tyr 245 250 255 TAT GATTGAGTTAAAA CATGTGACTT TTGGTTATAA TAAAAAGCAG ATGGTGCTAC 824 Tyr AspAAGATATCAA TATTACTATA CCTGATGGAG AAAATGTTGG TATTTTAGGC GAAAGTGGCT 884GTGGTAAAAG TACGCTCGCT TCATTGGTTC TTGGCTTGTT TAAACCTGTT AAAGGAGAGA 944TTTACTTAAG TGACAATGCT GTGTTACCGA TTTTCCAACA CCCTTTAACT AGCTTTAACC 1004CTGATTGGAC GATTGAGACC TCATTAAAAG AAGCGTTATA TTATTACAGA GGTCTAACTG 1064ATAATACTGC TCAGGATCAA TTATTATTAC AACATTTATC TACTTTTGAG TTAAACGCGC 1124AATTATTGAC TAAATTACCA AGCGAAGTGA GTGGCGGACA ATTACAAAGA TTTAATGTCA 1184TGCGTTCGTT ATTAGCACAG CCTCGCGTTT TAATATGTGA TGAGATAACT TCAAATTTAG 1244ATGTTATAGC TGAACAAAAT GTAATCAATA TATTAAAAGC GCAAACGATT ACGAACTTAA 1304ATCATTTTAT CGTTATTTCT CATGATTTAT CCGTGTTACA ACGCTTAGTT AATAGAATTA 1364TCGTTCTTAA GGATGGCATG ATAGTCGATG ATTTTGCAAT AGAGGAATTA TTTAATGTTG 1424ATAGACACCC TTATACAAAA GAATTAGTGC AAGCATTTTC ATATTAG 1471 258 amino acidsamino acid linear protein 18 Met Met Ser Leu Ile Asp Ile Gln Asn Leu ThrIle Lys Asn Thr Ser 1 5 10 15 Glu Lys Ser Leu Ile Lys Gly Ile Asp LeuLys Ile Phe Ser Gln Gln 20 25 30 Ile Asn Ala Leu Ile Gly Glu Ser Gly AlaGly Lys Ser Leu Ile Ala 35 40 45 Lys Ala Leu Leu Glu Tyr Leu Pro Phe AspLeu Ser Cys Thr Tyr Asp 50 55 60 Ser Tyr Gln Phe Asp Gly Glu Asn Val SerArg Leu Ser Gln Tyr Tyr 65 70 75 80 Gly His Thr Ile Gly Tyr Ile Ser GlnAsn Tyr Ala Glu Ser Phe Asn 85 90 95 Asp His Thr Lys Leu Gly Lys Gln LeuThr Ala Ile Tyr Arg Lys His 100 105 110 Tyr Lys Gly Ser Lys Glu Glu AlaLeu Ser Lys Val Asp Lys Ala Leu 115 120 125 Ser Trp Val Asn Leu Gln SerLys Asp Ile Leu Asn Lys Tyr Ser Phe 130 135 140 Gln Leu Ser Gly Gly GlnLeu Glu Arg Val Tyr Ile Ala Ser Val Leu 145 150 155 160 Met Leu Glu ProLys Leu Ile Ile Ala Asp Glu Pro Val Ala Ser Leu 165 170 175 Asp Ala LeuAsn Gly Asn Gln Val Met Asp Leu Leu Gln His Ile Val 180 185 190 Leu GluHis Gly Gln Thr Leu Phe Ile Ile Thr His Asn Leu Ser His 195 200 205 ValLeu Lys Tyr Cys Gln Tyr Ile Tyr Val Leu Lys Glu Gly Gln Ile 210 215 220Ile Glu Arg Gly Asn Ile Asn His Phe Lys Tyr Glu His Leu His Pro 225 230235 240 Tyr Thr Glu Arg Leu Ile Lys Tyr Arg Thr Gln Leu Lys Arg Asp Tyr245 250 255 Tyr Asp 1471 base pairs nucleic acid single linear othernucleic acid /desc = ”DNA (genomic) (p10c15)“ CDS 770..1468 19ATGATGAGTC TCATTGATAT ACAAAATTTA ACAATAAAGA ATACTAGTGA GAAATCTCTT 60ATTAAAGGGA TTGATTTGAA AATTTTTAGT CAACAGATTA ATGCCTTGAT TGGAGAGAGC 120GGCGCTGGAA AAAGTTTGAT TGCTAAAGCT TTACTTGAAT ATTTACCATT TGATTTAAGC 180TGCACGTATG ATTCGTACCA ATTTGATGGG GAAAATGTTA GTAGATTGAG TCAATATTAT 240GGTCATACAA TTGGCTATAT TTCTCAAAAT TATGCAGAAA GTTTTAACGA CCATACTAAA 300TTAGGTAAAC AGTTAACTGC GATTTATCGT AAGCATTATA AAGGTAGTAA AGAAGAGGCT 360TTGTCCAAAG TTGATAAGGC TTTGTCGTGG GTTAATTTAC AAAGCAAAGA TATATTAAAT 420AAATATAGTT TCCAACTTTC TGGGGGCCAA CTTGAACGCG TATACATAGC AAGCGTTCTC 480ATGTTGGAGC CTAAATTAAT CATTGCAGAC GAACCAGTTG CATCATTGGA TGCTTTGAAC 540GGTAATCAAG TGATGGATTT ATTACAGCAT ATTGTATTAG AACATGGTCA AACATTATTT 600ATTATCACAC ATAACTTAAG TCATGTATTG AAATATTGTC AGTACATTTA TGTTTTAAAA 660GAAGGTCAAA TCATTGAACG AGGTAATATT AATCATTTCA AGTATGAGCA TTTGCATCCG 720TATACTGAAC GTCTAATTAA ATATAGAACA CAATTAAAGA GGGATTACT ATG ATT 775 MetIle 1 GAG TTA AAA CAT GTG ACT TTT GGT TAT AAT AAA AAG CAG ATG GTG CTA823 Glu Leu Lys His Val Thr Phe Gly Tyr Asn Lys Lys Gln Met Val Leu 5 1015 CAA GAT ATC AAT ATT ACT ATA CCT GAT GGA GAA AAT GTT GGT ATT TTA 871Gln Asp Ile Asn Ile Thr Ile Pro Asp Gly Glu Asn Val Gly Ile Leu 20 25 30GGC GAA AGT GGC TGT GGT AAA AGT ACG CTC GCT TCA TTG GTT CTT GGC 919 GlyGlu Ser Gly Cys Gly Lys Ser Thr Leu Ala Ser Leu Val Leu Gly 35 40 45 50TTG TTT AAA CCT GTT AAA GGA GAG ATT TAC TTA AGT GAC AAT GCT GTG 967 LeuPhe Lys Pro Val Lys Gly Glu Ile Tyr Leu Ser Asp Asn Ala Val 55 60 65 TTACCG ATT TTC CAA CAC CCT TTA ACT AGC TTT AAC CCT GAT TGG ACG 1015 Leu ProIle Phe Gln His Pro Leu Thr Ser Phe Asn Pro Asp Trp Thr 70 75 80 ATT GAGACC TCA TTA AAA GAA GCG TTA TAT TAT TAC AGA GGT CTA ACT 1063 Ile Glu ThrSer Leu Lys Glu Ala Leu Tyr Tyr Tyr Arg Gly Leu Thr 85 90 95 GAT AAT ACTGCT CAG GAT CAA TTA TTA TTA CAA CAT TTA TCT ACT TTT 1111 Asp Asn Thr AlaGln Asp Gln Leu Leu Leu Gln His Leu Ser Thr Phe 100 105 110 GAG TTA AACGCG CAA TTA TTG ACT AAA TTA CCA AGC GAA GTG AGT GGC 1159 Glu Leu Asn AlaGln Leu Leu Thr Lys Leu Pro Ser Glu Val Ser Gly 115 120 125 130 GGA CAATTA CAA AGA TTT AAT GTC ATG CGT TCG TTA TTA GCA CAG CCT 1207 Gly Gln LeuGln Arg Phe Asn Val Met Arg Ser Leu Leu Ala Gln Pro 135 140 145 CGC GTTTTA ATA TGT GAT GAG ATA ACT TCA AAT TTA GAT GTT ATA GCT 1255 Arg Val LeuIle Cys Asp Glu Ile Thr Ser Asn Leu Asp Val Ile Ala 150 155 160 GAA CAAAAT GTA ATC AAT ATA TTA AAA GCG CAA ACG ATT ACG AAC TTA 1303 Glu Gln AsnVal Ile Asn Ile Leu Lys Ala Gln Thr Ile Thr Asn Leu 165 170 175 AAT CATTTT ATC GTT ATT TCT CAT GAT TTA TCC GTG TTA CAA CGC TTA 1351 Asn His PheIle Val Ile Ser His Asp Leu Ser Val Leu Gln Arg Leu 180 185 190 GTT AATAGA ATT ATC GTT CTT AAG GAT GGC ATG ATA GTC GAT GAT TTT 1399 Val Asn ArgIle Ile Val Leu Lys Asp Gly Met Ile Val Asp Asp Phe 195 200 205 210 GCAATA GAG GAA TTA TTT AAT GTT GAT AGA CAC CCT TAT ACA AAA GAA 1447 Ala IleGlu Glu Leu Phe Asn Val Asp Arg His Pro Tyr Thr Lys Glu 215 220 225 TTAGTG CAA GCA TTT TCA TAT TAG 1471 Leu Val Gln Ala Phe Ser Tyr 230 233amino acids amino acid linear protein 20 Met Ile Glu Leu Lys His Val ThrPhe Gly Tyr Asn Lys Lys Gln Met 1 5 10 15 Val Leu Gln Asp Ile Asn IleThr Ile Pro Asp Gly Glu Asn Val Gly 20 25 30 Ile Leu Gly Glu Ser Gly CysGly Lys Ser Thr Leu Ala Ser Leu Val 35 40 45 Leu Gly Leu Phe Lys Pro ValLys Gly Glu Ile Tyr Leu Ser Asp Asn 50 55 60 Ala Val Leu Pro Ile Phe GlnHis Pro Leu Thr Ser Phe Asn Pro Asp 65 70 75 80 Trp Thr Ile Glu Thr SerLeu Lys Glu Ala Leu Tyr Tyr Tyr Arg Gly 85 90 95 Leu Thr Asp Asn Thr AlaGln Asp Gln Leu Leu Leu Gln His Leu Ser 100 105 110 Thr Phe Glu Leu AsnAla Gln Leu Leu Thr Lys Leu Pro Ser Glu Val 115 120 125 Ser Gly Gly GlnLeu Gln Arg Phe Asn Val Met Arg Ser Leu Leu Ala 130 135 140 Gln Pro ArgVal Leu Ile Cys Asp Glu Ile Thr Ser Asn Leu Asp Val 145 150 155 160 IleAla Glu Gln Asn Val Ile Asn Ile Leu Lys Ala Gln Thr Ile Thr 165 170 175Asn Leu Asn His Phe Ile Val Ile Ser His Asp Leu Ser Val Leu Gln 180 185190 Arg Leu Val Asn Arg Ile Ile Val Leu Lys Asp Gly Met Ile Val Asp 195200 205 Asp Phe Ala Ile Glu Glu Leu Phe Asn Val Asp Arg His Pro Tyr Thr210 215 220 Lys Glu Leu Val Gln Ala Phe Ser Tyr 225 230 396 base pairsnucleic acid single linear other nucleic acid /desc = ”DNA (genomic)(p13b74)“ 21 AATCATAAAT TACGAATTAG ATATAACAAA AAAGAGCTTG GGACATTAAGTCCTTAAAGT 60 CTTAGGCAAT GTAAAAAAGC TGATTTCTAT TATTTATTTG ATAGAAATCAGCTTTTTTGA 120 TATGTATTTT ATAATGTACA GCTCGTTGAG CTGCTATTTT CCTTATATTAAGTGCCATTA 180 ATACAAAACC TAGCTCTCGT TTAACTTTAT TTATTCCCGA ACTGACATTCGAGTGAAACC 240 CAAAATAGCC TTCTAATCCA AAAACAGGCT CTACTCAATT TTCCTTTTGACTAATAGATT 300 TTTTCGTTTC TGGTTCCAAA GCCTTGAGGT TGTAGAATTC TTGAAGACGAAAAGGGCCCC 360 GTGATACGCC CTATTTTTAA AGGTTAATGT TCTGAA 396 1845 basepairs nucleic acid single linear other nucleic acid /desc = ”DNA(genomic) (p14c15)“ CDS 3..788 CDS 856..1842 22 GC ATC AAT GAA CAT CAGGAA TTA ACA ACT TTA GGC AGA GGT GGT TCT 47 Ile Asn Glu His Gln Glu LeuThr Thr Leu Gly Arg Gly Gly Ser 1 5 10 15 GAT ACG ACC GCT GTG GCA CTTGCT GTT AGT AAT CAA ATA CCT TGT GAA 95 Asp Thr Thr Ala Val Ala Leu AlaVal Ser Asn Gln Ile Pro Cys Glu 20 25 30 ATT TAT ACC GAC GTT GAT GGT GTGTAT GCC ACT GAC CCA AGA CTT TTA 143 Ile Tyr Thr Asp Val Asp Gly Val TyrAla Thr Asp Pro Arg Leu Leu 35 40 45 CCA AAG GCT AAA CGA CTA GAC ATC GTCTCA TAT GAA GAA ATG ATG GAA 191 Pro Lys Ala Lys Arg Leu Asp Ile Val SerTyr Glu Glu Met Met Glu 50 55 60 ATG AGC GCT TTA GGT GCT GGT GTA CTT GAAACA AGA AGT GTT GAA TTA 239 Met Ser Ala Leu Gly Ala Gly Val Leu Glu ThrArg Ser Val Glu Leu 65 70 75 GCT AAA AAC TAT AAT ATC CCT TTA TAT TTA GGAAAA ACT TTA TCG AAC 287 Ala Lys Asn Tyr Asn Ile Pro Leu Tyr Leu Gly LysThr Leu Ser Asn 80 85 90 95 GTG AAA GGA ACA TGG ATT ATG TCA AAT GAA GAAATA TTA GAG AAA AAA 335 Val Lys Gly Thr Trp Ile Met Ser Asn Glu Glu IleLeu Glu Lys Lys 100 105 110 GCA GTT ACT GGT GTG GCT TTG GAT AAA CAT ATGATG CAT GTA ACA ATT 383 Ala Val Thr Gly Val Ala Leu Asp Lys His Met MetHis Val Thr Ile 115 120 125 AGT TAT CCC CTA CCT GAC AAT CAG CTA CTT ACCCAA CTA TTT ACG GAA 431 Ser Tyr Pro Leu Pro Asp Asn Gln Leu Leu Thr GlnLeu Phe Thr Glu 130 135 140 CTT GAA GAA GGT GCT GTA AAT GTT GAT ATG ATTTCA CAA ATC GTC AAC 479 Leu Glu Glu Gly Ala Val Asn Val Asp Met Ile SerGln Ile Val Asn 145 150 155 TTG GAT GGG CTA CAA CTA TCC TTC ACG ATT AAAGAT AGT GAT TTT CAT 527 Leu Asp Gly Leu Gln Leu Ser Phe Thr Ile Lys AspSer Asp Phe His 160 165 170 175 CAA ATT TCT ATG ATT CTT GAA ACA TTA AAGAAT CAA TAT GAA GCA TTA 575 Gln Ile Ser Met Ile Leu Glu Thr Leu Lys AsnGln Tyr Glu Ala Leu 180 185 190 GCT TAT AAA ATC AAT GAG CAT TAT GTC AAAATT TCA TTA ATT GGC TCA 623 Ala Tyr Lys Ile Asn Glu His Tyr Val Lys IleSer Leu Ile Gly Ser 195 200 205 GGC ATG CGT GAT ATG TCA GGT GTG GCA TCAAAA GCA TTT TTG ACA TTA 671 Gly Met Arg Asp Met Ser Gly Val Ala Ser LysAla Phe Leu Thr Leu 210 215 220 ATT GAA AAT AAT ATA CCT TTC TAC CAA ACAACA ACA TCT GAA ATA AGT 719 Ile Glu Asn Asn Ile Pro Phe Tyr Gln Thr ThrThr Ser Glu Ile Ser 225 230 235 240 ATT TCA TAC GTC ATT GAT GAT TTT AATGGG CAA CAA GCG GTA GAA AAA 767 Ile Ser Tyr Val Ile Asp Asp Phe Asn GlyGln Gln Ala Val Glu Lys 245 250 255 CTA TAT GAC GCA TTT AAC ATTTAATGGTAAA ATGATTGTTA AAATATTCTA 818 Leu Tyr Asp Ala Phe Asn Ile 260AAAATTGGGA AATTATTATA AAATGGAGTG ACAAGTT ATG ACA AAG TTA GCA GTT 873 MetThr Lys Leu Ala Val 1 5 GTG GGT GCA ACA GGA TTA GTA GGT ACA AAA ATG TTGGAG ACA TTA AAT 921 Val Gly Ala Thr Gly Leu Val Gly Thr Lys Met Leu GluThr Leu Asn 10 15 20 CGT AAA AAT ATT CCT TTC GAT GAA TTA GTA TTA TTT TCATCA GCA CGT 969 Arg Lys Asn Ile Pro Phe Asp Glu Leu Val Leu Phe Ser SerAla Arg 25 30 35 TCT GCA GGG CAA GAA GTT GAA TTT CAA GGA AAA ACA TAT ACAGTT CAA 1017 Ser Ala Gly Gln Glu Val Glu Phe Gln Gly Lys Thr Tyr Thr ValGln 40 45 50 GAA TTA ACT GAT GCT CGT GCA AGT GAA CAT TTC GAT TAT GTA TTAATG 1065 Glu Leu Thr Asp Ala Arg Ala Ser Glu His Phe Asp Tyr Val Leu Met55 60 65 70 AGT GCT GGT GGC GGT ACA AGC GAA CAC TTT GCC CCA CTT TTT GAAAAA 1113 Ser Ala Gly Gly Gly Thr Ser Glu His Phe Ala Pro Leu Phe Glu Lys75 80 85 GCT GGT GCA ATC GTT ATA GAC AAT TCA AGT CAA TGG CGT ATG GCA GAA1161 Ala Gly Ala Ile Val Ile Asp Asn Ser Ser Gln Trp Arg Met Ala Glu 9095 100 GAT ATT GAT TTA ATC GTT CCG GAA GTC AAT GAA CCT ACA TTT ACA AGA1209 Asp Ile Asp Leu Ile Val Pro Glu Val Asn Glu Pro Thr Phe Thr Arg 105110 115 GGT ATC ATT GCC AAT CCA AAC TGC TCT ACG ATT CAA TCT GTT GTA CCT1257 Gly Ile Ile Ala Asn Pro Asn Cys Ser Thr Ile Gln Ser Val Val Pro 120125 130 CTA AAA GTA TTG CAA GAT GCT TAT GGT TTA AAA CGA GTG GCA TAT ACA1305 Leu Lys Val Leu Gln Asp Ala Tyr Gly Leu Lys Arg Val Ala Tyr Thr 135140 145 150 ACA TAT CAA GCT GTA TCA GGT TCA GGG ATG AAA GGT AAG AAA GATTTA 1353 Thr Tyr Gln Ala Val Ser Gly Ser Gly Met Lys Gly Lys Lys Asp Leu155 160 165 GCT GAA GGT GTA AAT GGT AAA GCA CCA GAA GCA TAT CCA CAT CCAATT 1401 Ala Glu Gly Val Asn Gly Lys Ala Pro Glu Ala Tyr Pro His Pro Ile170 175 180 TAT AAT AAT GTG TTA CCG CAT ATT GAT GTG TTT TTA GAA AAC GGATAT 1449 Tyr Asn Asn Val Leu Pro His Ile Asp Val Phe Leu Glu Asn Gly Tyr185 190 195 ACA AAA GAA GAA CAA AAA ATG ATT GAT GAG ACG AGA AAA ATT TTAAAT 1497 Thr Lys Glu Glu Gln Lys Met Ile Asp Glu Thr Arg Lys Ile Leu Asn200 205 210 GCG CCA GAC TTA AAA GTA ACA GCA ACA TGC GCA CGT GTG CCT GTTCAA 1545 Ala Pro Asp Leu Lys Val Thr Ala Thr Cys Ala Arg Val Pro Val Gln215 220 225 230 GAT AGT CAT AGT GTT GAA ATT GAT GTA ACG CTT GAC AAA GAAACA ACA 1593 Asp Ser His Ser Val Glu Ile Asp Val Thr Leu Asp Lys Glu ThrThr 235 240 245 GCA GAA GAT ATT AAA GCG TTA TTT GAT CAA GAT GAC CGC GTTGTT TTA 1641 Ala Glu Asp Ile Lys Ala Leu Phe Asp Gln Asp Asp Arg Val ValLeu 250 255 260 GTA GAC AAT CCA GAG AAC AAT GAA TAT CCA ATG GCA ATC AATTCT ACT 1689 Val Asp Asn Pro Glu Asn Asn Glu Tyr Pro Met Ala Ile Asn SerThr 265 270 275 AAT AAA GAT GAA GTG TTT GTT GGC CGT ATA CGT AGA GAT GATTCA TTA 1737 Asn Lys Asp Glu Val Phe Val Gly Arg Ile Arg Arg Asp Asp SerLeu 280 285 290 GAA AAT ACT TTC CAT GTA TGG TGT ACA TCA GAC AAT TTA TTAAAA GGT 1785 Glu Asn Thr Phe His Val Trp Cys Thr Ser Asp Asn Leu Leu LysGly 295 300 305 310 GCT GCA TTA AAT GCT GTA CAA GTA TTG GAA CAA GTT ATGCGT TTA AAA 1833 Ala Ala Leu Asn Ala Val Gln Val Leu Glu Gln Val Met ArgLeu Lys 315 320 325 GGA GCG AAT TAA 1845 Gly Ala Asn 262 amino acidsamino acid linear protein 23 Ile Asn Glu His Gln Glu Leu Thr Thr Leu GlyArg Gly Gly Ser Asp 1 5 10 15 Thr Thr Ala Val Ala Leu Ala Val Ser AsnGln Ile Pro Cys Glu Ile 20 25 30 Tyr Thr Asp Val Asp Gly Val Tyr Ala ThrAsp Pro Arg Leu Leu Pro 35 40 45 Lys Ala Lys Arg Leu Asp Ile Val Ser TyrGlu Glu Met Met Glu Met 50 55 60 Ser Ala Leu Gly Ala Gly Val Leu Glu ThrArg Ser Val Glu Leu Ala 65 70 75 80 Lys Asn Tyr Asn Ile Pro Leu Tyr LeuGly Lys Thr Leu Ser Asn Val 85 90 95 Lys Gly Thr Trp Ile Met Ser Asn GluGlu Ile Leu Glu Lys Lys Ala 100 105 110 Val Thr Gly Val Ala Leu Asp LysHis Met Met His Val Thr Ile Ser 115 120 125 Tyr Pro Leu Pro Asp Asn GlnLeu Leu Thr Gln Leu Phe Thr Glu Leu 130 135 140 Glu Glu Gly Ala Val AsnVal Asp Met Ile Ser Gln Ile Val Asn Leu 145 150 155 160 Asp Gly Leu GlnLeu Ser Phe Thr Ile Lys Asp Ser Asp Phe His Gln 165 170 175 Ile Ser MetIle Leu Glu Thr Leu Lys Asn Gln Tyr Glu Ala Leu Ala 180 185 190 Tyr LysIle Asn Glu His Tyr Val Lys Ile Ser Leu Ile Gly Ser Gly 195 200 205 MetArg Asp Met Ser Gly Val Ala Ser Lys Ala Phe Leu Thr Leu Ile 210 215 220Glu Asn Asn Ile Pro Phe Tyr Gln Thr Thr Thr Ser Glu Ile Ser Ile 225 230235 240 Ser Tyr Val Ile Asp Asp Phe Asn Gly Gln Gln Ala Val Glu Lys Leu245 250 255 Tyr Asp Ala Phe Asn Ile 260 329 amino acids amino acidlinear protein 24 Met Thr Lys Leu Ala Val Val Gly Ala Thr Gly Leu ValGly Thr Lys 1 5 10 15 Met Leu Glu Thr Leu Asn Arg Lys Asn Ile Pro PheAsp Glu Leu Val 20 25 30 Leu Phe Ser Ser Ala Arg Ser Ala Gly Gln Glu ValGlu Phe Gln Gly 35 40 45 Lys Thr Tyr Thr Val Gln Glu Leu Thr Asp Ala ArgAla Ser Glu His 50 55 60 Phe Asp Tyr Val Leu Met Ser Ala Gly Gly Gly ThrSer Glu His Phe 65 70 75 80 Ala Pro Leu Phe Glu Lys Ala Gly Ala Ile ValIle Asp Asn Ser Ser 85 90 95 Gln Trp Arg Met Ala Glu Asp Ile Asp Leu IleVal Pro Glu Val Asn 100 105 110 Glu Pro Thr Phe Thr Arg Gly Ile Ile AlaAsn Pro Asn Cys Ser Thr 115 120 125 Ile Gln Ser Val Val Pro Leu Lys ValLeu Gln Asp Ala Tyr Gly Leu 130 135 140 Lys Arg Val Ala Tyr Thr Thr TyrGln Ala Val Ser Gly Ser Gly Met 145 150 155 160 Lys Gly Lys Lys Asp LeuAla Glu Gly Val Asn Gly Lys Ala Pro Glu 165 170 175 Ala Tyr Pro His ProIle Tyr Asn Asn Val Leu Pro His Ile Asp Val 180 185 190 Phe Leu Glu AsnGly Tyr Thr Lys Glu Glu Gln Lys Met Ile Asp Glu 195 200 205 Thr Arg LysIle Leu Asn Ala Pro Asp Leu Lys Val Thr Ala Thr Cys 210 215 220 Ala ArgVal Pro Val Gln Asp Ser His Ser Val Glu Ile Asp Val Thr 225 230 235 240Leu Asp Lys Glu Thr Thr Ala Glu Asp Ile Lys Ala Leu Phe Asp Gln 245 250255 Asp Asp Arg Val Val Leu Val Asp Asn Pro Glu Asn Asn Glu Tyr Pro 260265 270 Met Ala Ile Asn Ser Thr Asn Lys Asp Glu Val Phe Val Gly Arg Ile275 280 285 Arg Arg Asp Asp Ser Leu Glu Asn Thr Phe His Val Trp Cys ThrSer 290 295 300 Asp Asn Leu Leu Lys Gly Ala Ala Leu Asn Ala Val Gln ValLeu Glu 305 310 315 320 Gln Val Met Arg Leu Lys Gly Ala Asn 325 927 basepairs nucleic acid single linear other nucleic acid /desc = ”DNA(genomic) (p13b26)“ CDS 1..924 25 ATG AAC GAA GCC GAC ATG CTG TTC TCTGTC ACT GTT CCC GGA AGC ACA 48 Met Asn Glu Ala Asp Met Leu Phe Ser ValThr Val Pro Gly Ser Thr 1 5 10 15 GCT AAC CTA GGC CCC GGC TTT GAT TCAGTC GGA ATG GCG CTC AGC AGA 96 Ala Asn Leu Gly Pro Gly Phe Asp Ser ValGly Met Ala Leu Ser Arg 20 25 30 TAT TTG AAG CTG ACC GTC TTT GAA AGC GACAAA TGG TCT TTT GAG GCT 144 Tyr Leu Lys Leu Thr Val Phe Glu Ser Asp LysTrp Ser Phe Glu Ala 35 40 45 GAA ACA GAA ACA GTC GCC GGA ATT CGC GGT ACAGAT AAC CTG ATC TAC 192 Glu Thr Glu Thr Val Ala Gly Ile Arg Gly Thr AspAsn Leu Ile Tyr 50 55 60 CAA GTG GCT AAA CGG ACC GCA GAT TTG TAC GGA AAAGAA ATG CCT CCT 240 Gln Val Ala Lys Arg Thr Ala Asp Leu Tyr Gly Lys GluMet Pro Pro 65 70 75 80 GTC CAT GTG AAG GTG TGG AGC GAC ATC CCG CTT GCACGC GGC CTT GGC 288 Val His Val Lys Val Trp Ser Asp Ile Pro Leu Ala ArgGly Leu Gly 85 90 95 AGC AGC GCC GCA GCG ATT GTA GCG GCC ATT GAA CTG GCTGAT GAA TTA 336 Ser Ser Ala Ala Ala Ile Val Ala Ala Ile Glu Leu Ala AspGlu Leu 100 105 110 TGC GGC TTA AAG CTG TCT GAA GCG GAC AAG CTG CAT TTAGCG AGT CTA 384 Cys Gly Leu Lys Leu Ser Glu Ala Asp Lys Leu His Leu AlaSer Leu 115 120 125 GAA GAA GGA CAC CCG GAC AAT GCT GGC GCT TCT CTC GTCGGC GGA CTT 432 Glu Glu Gly His Pro Asp Asn Ala Gly Ala Ser Leu Val GlyGly Leu 130 135 140 GTG ATC GGC CTG CAT GAG GAT GAC GAG ACC CAA ATG ATCCGC GTC CCG 480 Val Ile Gly Leu His Glu Asp Asp Glu Thr Gln Met Ile ArgVal Pro 145 150 155 160 AAT GCT GAC ATT GAC GTA GTC GTT GTC ATT CCT TTTTAT GAA GTG CTG 528 Asn Ala Asp Ile Asp Val Val Val Val Ile Pro Phe TyrGlu Val Leu 165 170 175 ACA AGA GAC GCG AGA GAC GTG CTT CCG AAG GAG TTTCCA TAT GCC GAT 576 Thr Arg Asp Ala Arg Asp Val Leu Pro Lys Glu Phe ProTyr Ala Asp 180 185 190 GCC GTA AAA GCA AGT GCT GTC AGC AAT ATC CTC ATTGCT GCG ATC ATG 624 Ala Val Lys Ala Ser Ala Val Ser Asn Ile Leu Ile AlaAla Ile Met 195 200 205 TCC AAG GAT TGG CCG CTT GTC GGG AAA ATC ATG AAGAAG GAT ATG TTC 672 Ser Lys Asp Trp Pro Leu Val Gly Lys Ile Met Lys LysAsp Met Phe 210 215 220 CAT CAG CCG TAC CGG GCG ATG CTT GTG CCT GAG CTGTCA AAA GTA GAG 720 His Gln Pro Tyr Arg Ala Met Leu Val Pro Glu Leu SerLys Val Glu 225 230 235 240 CAC GTC GCC GAG ATG AAG GGC GCA TAT GGA ACGGCT CTC AGC GGA GCA 768 His Val Ala Glu Met Lys Gly Ala Tyr Gly Thr AlaLeu Ser Gly Ala 245 250 255 GGC CCA ACG ATT CTC GTC ATG ACC GAA AAA GGAAAG GGA GAA GAG CTA 816 Gly Pro Thr Ile Leu Val Met Thr Glu Lys Gly LysGly Glu Glu Leu 260 265 270 AAA GAA CAG CTC GCG CTT CAT TTC CCT CAT TGTGAA GTA GAC GCT TTG 864 Lys Glu Gln Leu Ala Leu His Phe Pro His Cys GluVal Asp Ala Leu 275 280 285 ACC GTT CCG AAA GAG GGA AGT ATA ATA GAG CGAAAT CCT TTA TAT CAA 912 Thr Val Pro Lys Glu Gly Ser Ile Ile Glu Arg AsnPro Leu Tyr Gln 290 295 300 GTA AAA AGT GTA TAG 927 Val Lys Ser Val 305308 amino acids amino acid linear protein 26 Met Asn Glu Ala Asp Met LeuPhe Ser Val Thr Val Pro Gly Ser Thr 1 5 10 15 Ala Asn Leu Gly Pro GlyPhe Asp Ser Val Gly Met Ala Leu Ser Arg 20 25 30 Tyr Leu Lys Leu Thr ValPhe Glu Ser Asp Lys Trp Ser Phe Glu Ala 35 40 45 Glu Thr Glu Thr Val AlaGly Ile Arg Gly Thr Asp Asn Leu Ile Tyr 50 55 60 Gln Val Ala Lys Arg ThrAla Asp Leu Tyr Gly Lys Glu Met Pro Pro 65 70 75 80 Val His Val Lys ValTrp Ser Asp Ile Pro Leu Ala Arg Gly Leu Gly 85 90 95 Ser Ser Ala Ala AlaIle Val Ala Ala Ile Glu Leu Ala Asp Glu Leu 100 105 110 Cys Gly Leu LysLeu Ser Glu Ala Asp Lys Leu His Leu Ala Ser Leu 115 120 125 Glu Glu GlyHis Pro Asp Asn Ala Gly Ala Ser Leu Val Gly Gly Leu 130 135 140 Val IleGly Leu His Glu Asp Asp Glu Thr Gln Met Ile Arg Val Pro 145 150 155 160Asn Ala Asp Ile Asp Val Val Val Val Ile Pro Phe Tyr Glu Val Leu 165 170175 Thr Arg Asp Ala Arg Asp Val Leu Pro Lys Glu Phe Pro Tyr Ala Asp 180185 190 Ala Val Lys Ala Ser Ala Val Ser Asn Ile Leu Ile Ala Ala Ile Met195 200 205 Ser Lys Asp Trp Pro Leu Val Gly Lys Ile Met Lys Lys Asp MetPhe 210 215 220 His Gln Pro Tyr Arg Ala Met Leu Val Pro Glu Leu Ser LysVal Glu 225 230 235 240 His Val Ala Glu Met Lys Gly Ala Tyr Gly Thr AlaLeu Ser Gly Ala 245 250 255 Gly Pro Thr Ile Leu Val Met Thr Glu Lys GlyLys Gly Glu Glu Leu 260 265 270 Lys Glu Gln Leu Ala Leu His Phe Pro HisCys Glu Val Asp Ala Leu 275 280 285 Thr Val Pro Lys Glu Gly Ser Ile IleGlu Arg Asn Pro Leu Tyr Gln 290 295 300 Val Lys Ser Val 305 1320 basepairs nucleic acid single linear other nucleic acid /desc = ”DNA(genomic) (p7c18)“ CDS 1..1317 27 TTG TTC TTA CAC GGC ACA AGC AGA CAAAAT CAA CAT GGT CAT TTA GAA 48 Leu Phe Leu His Gly Thr Ser Arg Gln AsnGln His Gly His Leu Glu 1 5 10 15 ATC GGA GGT GTG GAT GCT CTC TAT TTAGCG GAG AAA TAT GGT ACA CCT 96 Ile Gly Gly Val Asp Ala Leu Tyr Leu AlaGlu Lys Tyr Gly Thr Pro 20 25 30 CTT TAC GTA TAT GAT GTG GCT TTA ATA CGTGAG CGT GCT AAA AGC TTT 144 Leu Tyr Val Tyr Asp Val Ala Leu Ile Arg GluArg Ala Lys Ser Phe 35 40 45 AAG CAG GCG TTT ATT TCT GCA GGG CTG AAA GCACAG GTG GCA TAT GCG 192 Lys Gln Ala Phe Ile Ser Ala Gly Leu Lys Ala GlnVal Ala Tyr Ala 50 55 60 AGC AAA GCA TTC TCA TCA GTC GCA ATG ATT CAG CTCGCT GAG GAA GAG 240 Ser Lys Ala Phe Ser Ser Val Ala Met Ile Gln Leu AlaGlu Glu Glu 65 70 75 80 GGA CTT TCT TTA GAT GTC GTA TCC GGA GGA GAG CTATAT ACG GCT GTT 288 Gly Leu Ser Leu Asp Val Val Ser Gly Gly Glu Leu TyrThr Ala Val 85 90 95 GCA GCA GGC TTT CCG GCA GAA CGC ATC CAC TTT CAT GGAAAC AAT AAG 336 Ala Ala Gly Phe Pro Ala Glu Arg Ile His Phe His Gly AsnAsn Lys 100 105 110 AGC AGG GAA GAA CTG CGG ATG GCG CTT GAG CAC CGC ATCGGC TGC ATT 384 Ser Arg Glu Glu Leu Arg Met Ala Leu Glu His Arg Ile GlyCys Ile 115 120 125 GTG GTG GAT AAT TTC TAT GAA ATC GCG CTT CTT GAA GACCTA TGT AAA 432 Val Val Asp Asn Phe Tyr Glu Ile Ala Leu Leu Glu Asp LeuCys Lys 130 135 140 GAA ACG GGT CAC TCC ATC GAT GTT CTT CTT CGG ATC ACGCCC GGA GTA 480 Glu Thr Gly His Ser Ile Asp Val Leu Leu Arg Ile Thr ProGly Val 145 150 155 160 GAA GCG CAT ACG CAT GAC TAC ATT ACA ACG GGC CAGGAA GAT TCA AAG 528 Glu Ala His Thr His Asp Tyr Ile Thr Thr Gly Gln GluAsp Ser Lys 165 170 175 TTT GGT TTC GAT CTT CAT AAC GGA CAA ACT GAA CGGGCC ATT GAA CAA 576 Phe Gly Phe Asp Leu His Asn Gly Gln Thr Glu Arg AlaIle Glu Gln 180 185 190 GTA TTA CAA TCG GAA CAC ATT CAG CTG CTG GGT GTCCAT TGC CAT ATC 624 Val Leu Gln Ser Glu His Ile Gln Leu Leu Gly Val HisCys His Ile 195 200 205 GGC TCG CAA ATC TTT GAT ACG GCC GGT TTT GTG TTAGCA GCG GAA AAA 672 Gly Ser Gln Ile Phe Asp Thr Ala Gly Phe Val Leu AlaAla Glu Lys 210 215 220 ATC TTC AAA AAA CTA GAC GAA TGG AGA GAT TCA TATTCA TTT GTA TCC 720 Ile Phe Lys Lys Leu Asp Glu Trp Arg Asp Ser Tyr SerPhe Val Ser 225 230 235 240 AAG GTG CTG AAT CTT GGA GGA GGT TTC GGC ATTCGT TAT ACG GAA GAT 768 Lys Val Leu Asn Leu Gly Gly Gly Phe Gly Ile ArgTyr Thr Glu Asp 245 250 255 GAT GAA CCG CTT CAT GCC ACT GAA TAC GTT GAAAAA ATT ATC GAA GCT 816 Asp Glu Pro Leu His Ala Thr Glu Tyr Val Glu LysIle Ile Glu Ala 260 265 270 GTG AAA GAA AAT GCT TCC CGT TAC GGT TTT GACATT CCG GAA ATT TGG 864 Val Lys Glu Asn Ala Ser Arg Tyr Gly Phe Asp IlePro Glu Ile Trp 275 280 285 ATC GAA CCG GGC CGT TCT CTC GTG GGA GAC GCAGGC ACA ACT CTT TAT 912 Ile Glu Pro Gly Arg Ser Leu Val Gly Asp Ala GlyThr Thr Leu Tyr 290 295 300 ACG GTT GGC TCT CAA AAA GAA GTG CCG GGT GTCCGC CAA TAT GTG GCT 960 Thr Val Gly Ser Gln Lys Glu Val Pro Gly Val ArgGln Tyr Val Ala 305 310 315 320 GTA GAC GGA GGC ATG AAC GAC AAT ATT CGTCCT GCG CTT TAC CAA GCT 1008 Val Asp Gly Gly Met Asn Asp Asn Ile Arg ProAla Leu Tyr Gln Ala 325 330 335 AAA TAT GAA GCT GCG GCA GCC AAC AGG ATCGGA GAA GCG CAT GAC AAA 1056 Lys Tyr Glu Ala Ala Ala Ala Asn Arg Ile GlyGlu Ala His Asp Lys 340 345 350 ACG GTA TCA ATT GCC GGA AAG TGC TGT GAAAGC GGA GAT ATG CTG ATT 1104 Thr Val Ser Ile Ala Gly Lys Cys Cys Glu SerGly Asp Met Leu Ile 355 360 365 TGG GAT ATT GAC CTG CCG GAA GTA AAA GAAGGC GAT CTT CTT GCC GTT 1152 Trp Asp Ile Asp Leu Pro Glu Val Lys Glu GlyAsp Leu Leu Ala Val 370 375 380 TTT TGT ACA GGC GCT TAT GGA TAC AGC ATGGCC AAC AAT TAT AAC CGT 1200 Phe Cys Thr Gly Ala Tyr Gly Tyr Ser Met AlaAsn Asn Tyr Asn Arg 385 390 395 400 ATT CCG AGA CCC GCC GTT GTA TTT GTCGAA AAC GGT GAG GCT CAT TTA 1248 Ile Pro Arg Pro Ala Val Val Phe Val GluAsn Gly Glu Ala His Leu 405 410 415 GTC GTG AAG CGA GAA ACA TAC GAA GATATT GTA AAA CTT GAT CTG CCA 1296 Val Val Lys Arg Glu Thr Tyr Glu Asp IleVal Lys Leu Asp Leu Pro 420 425 430 TTT AAA ACG GGT GTA AAG CAA TAA 1320Phe Lys Thr Gly Val Lys Gln 435 439 amino acids amino acid linearprotein 28 Leu Phe Leu His Gly Thr Ser Arg Gln Asn Gln His Gly His LeuGlu 1 5 10 15 Ile Gly Gly Val Asp Ala Leu Tyr Leu Ala Glu Lys Tyr GlyThr Pro 20 25 30 Leu Tyr Val Tyr Asp Val Ala Leu Ile Arg Glu Arg Ala LysSer Phe 35 40 45 Lys Gln Ala Phe Ile Ser Ala Gly Leu Lys Ala Gln Val AlaTyr Ala 50 55 60 Ser Lys Ala Phe Ser Ser Val Ala Met Ile Gln Leu Ala GluGlu Glu 65 70 75 80 Gly Leu Ser Leu Asp Val Val Ser Gly Gly Glu Leu TyrThr Ala Val 85 90 95 Ala Ala Gly Phe Pro Ala Glu Arg Ile His Phe His GlyAsn Asn Lys 100 105 110 Ser Arg Glu Glu Leu Arg Met Ala Leu Glu His ArgIle Gly Cys Ile 115 120 125 Val Val Asp Asn Phe Tyr Glu Ile Ala Leu LeuGlu Asp Leu Cys Lys 130 135 140 Glu Thr Gly His Ser Ile Asp Val Leu LeuArg Ile Thr Pro Gly Val 145 150 155 160 Glu Ala His Thr His Asp Tyr IleThr Thr Gly Gln Glu Asp Ser Lys 165 170 175 Phe Gly Phe Asp Leu His AsnGly Gln Thr Glu Arg Ala Ile Glu Gln 180 185 190 Val Leu Gln Ser Glu HisIle Gln Leu Leu Gly Val His Cys His Ile 195 200 205 Gly Ser Gln Ile PheAsp Thr Ala Gly Phe Val Leu Ala Ala Glu Lys 210 215 220 Ile Phe Lys LysLeu Asp Glu Trp Arg Asp Ser Tyr Ser Phe Val Ser 225 230 235 240 Lys ValLeu Asn Leu Gly Gly Gly Phe Gly Ile Arg Tyr Thr Glu Asp 245 250 255 AspGlu Pro Leu His Ala Thr Glu Tyr Val Glu Lys Ile Ile Glu Ala 260 265 270Val Lys Glu Asn Ala Ser Arg Tyr Gly Phe Asp Ile Pro Glu Ile Trp 275 280285 Ile Glu Pro Gly Arg Ser Leu Val Gly Asp Ala Gly Thr Thr Leu Tyr 290295 300 Thr Val Gly Ser Gln Lys Glu Val Pro Gly Val Arg Gln Tyr Val Ala305 310 315 320 Val Asp Gly Gly Met Asn Asp Asn Ile Arg Pro Ala Leu TyrGln Ala 325 330 335 Lys Tyr Glu Ala Ala Ala Ala Asn Arg Ile Gly Glu AlaHis Asp Lys 340 345 350 Thr Val Ser Ile Ala Gly Lys Cys Cys Glu Ser GlyAsp Met Leu Ile 355 360 365 Trp Asp Ile Asp Leu Pro Glu Val Lys Glu GlyAsp Leu Leu Ala Val 370 375 380 Phe Cys Thr Gly Ala Tyr Gly Tyr Ser MetAla Asn Asn Tyr Asn Arg 385 390 395 400 Ile Pro Arg Pro Ala Val Val PheVal Glu Asn Gly Glu Ala His Leu 405 410 415 Val Val Lys Arg Glu Thr TyrGlu Asp Ile Val Lys Leu Asp Leu Pro 420 425 430 Phe Lys Thr Gly Val LysGln 435 2190 base pairs nucleic acid single linear other nucleic acid/desc = ”DNA (genomic) (p15c31)“ CDS 1..2187 29 ATG TCT AAA TTT ATC GAACCA AGC GTT GAA GAA ATT AAA CTT GAA AAA 48 Met Ser Lys Phe Ile Glu ProSer Val Glu Glu Ile Lys Leu Glu Lys 1 5 10 15 GTA TAT CAA GAT ATG GGATTA AGT GAT CAA GAA TAT GAA AAA GTT TGC 96 Val Tyr Gln Asp Met Gly LeuSer Asp Gln Glu Tyr Glu Lys Val Cys 20 25 30 GAT ATT TTA GGC AGA CAA CCTAAC TTT ACA GAA ACA GGT ATC TTT TCT 144 Asp Ile Leu Gly Arg Gln Pro AsnPhe Thr Glu Thr Gly Ile Phe Ser 35 40 45 GTT ATG TGG AGT GAA CAT TGC TCTTAT AAA CAT TCT AAA CCG TTT TTA 192 Val Met Trp Ser Glu His Cys Ser TyrLys His Ser Lys Pro Phe Leu 50 55 60 AAG CAA TTT CCT ACG TCA GGT GAC CATGTG CTT ATG GGG CCT GGT GAA 240 Lys Gln Phe Pro Thr Ser Gly Asp His ValLeu Met Gly Pro Gly Glu 65 70 75 80 GGT GCA GGG GTA GTC GAT ATA GGT GATAAT CAA GCC GTA GTA TTT AAA 288 Gly Ala Gly Val Val Asp Ile Gly Asp AsnGln Ala Val Val Phe Lys 85 90 95 GTA GAG TCT CAC AAT CAT CCA TCA GCA ATTGAA CCA TAT CAA GGG GCT 336 Val Glu Ser His Asn His Pro Ser Ala Ile GluPro Tyr Gln Gly Ala 100 105 110 GCT ACA GGC GTT GGT GGA ATC ATT CGT GACATT GTC TCT ATT GGG GCT 384 Ala Thr Gly Val Gly Gly Ile Ile Arg Asp IleVal Ser Ile Gly Ala 115 120 125 AGA CCT ATT AAT TTG TTA AAC AGT CTT AGATTT GGA GAA TTA GAT AAT 432 Arg Pro Ile Asn Leu Leu Asn Ser Leu Arg PheGly Glu Leu Asp Asn 130 135 140 AAA CAA AAC CAA AGA TTA CTT AAA GGT GTTGTA AAG GGT ATC GGA GGT 480 Lys Gln Asn Gln Arg Leu Leu Lys Gly Val ValLys Gly Ile Gly Gly 145 150 155 160 TAT GGT AAC TGC ATT GGT ATT CCA ACAACT GCT GGT GAA ATC GAA TTT 528 Tyr Gly Asn Cys Ile Gly Ile Pro Thr ThrAla Gly Glu Ile Glu Phe 165 170 175 GAT GAA CGT TAT GAT GGC AAT CCA CTTGTT AAT GCA ATG TGT GTT GGT 576 Asp Glu Arg Tyr Asp Gly Asn Pro Leu ValAsn Ala Met Cys Val Gly 180 185 190 GTT ATC AAT CAC GAC ATG ATT CAA AAAGGC ACA GCA AAA GGT GTA GGT 624 Val Ile Asn His Asp Met Ile Gln Lys GlyThr Ala Lys Gly Val Gly 195 200 205 AAT TCG GTC ATT TAT GTT GGT TTG AAAACT GGT CGA GAT GGT ATT CAT 672 Asn Ser Val Ile Tyr Val Gly Leu Lys ThrGly Arg Asp Gly Ile His 210 215 220 GGT GCT ACT TTT GCA TCT GAA GAA TTGACG GAA GAA AGC GAA AGT AAA 720 Gly Ala Thr Phe Ala Ser Glu Glu Leu ThrGlu Glu Ser Glu Ser Lys 225 230 235 240 CGA CCT TCT GTA CAA ATC GGT GATCCA TTT GTA GGT AAA AAA TTA ATG 768 Arg Pro Ser Val Gln Ile Gly Asp ProPhe Val Gly Lys Lys Leu Met 245 250 255 GAA GCA ACA CTT GAA GCA ATT ACATTT GAT GAA TTA GTT GGT ATT CAA 816 Glu Ala Thr Leu Glu Ala Ile Thr PheAsp Glu Leu Val Gly Ile Gln 260 265 270 GAT ATG GGT GCT GCT GGT TTA ACATCT TCA TCG TCT GAA ATG GCG GCA 864 Asp Met Gly Ala Ala Gly Leu Thr SerSer Ser Ser Glu Met Ala Ala 275 280 285 AAA GGT GGT AGT GGG TTA CAT TTGAGA TTA GAA CAA GTG CCA ACA CGT 912 Lys Gly Gly Ser Gly Leu His Leu ArgLeu Glu Gln Val Pro Thr Arg 290 295 300 GAG CCA GGT ATT TCT CCT TAT GAAATG ATG CTT TCA GAA ACT CAA GAA 960 Glu Pro Gly Ile Ser Pro Tyr Glu MetMet Leu Ser Glu Thr Gln Glu 305 310 315 320 CGT ATG TTA CTA GTT GTT GAAAAA GGT ACT GAA CAA AAA TTC TTA GAT 1008 Arg Met Leu Leu Val Val Glu LysGly Thr Glu Gln Lys Phe Leu Asp 325 330 335 TTA TTT GAT AAG CAC GAA TTGGAT AGT GCT GTT ATA GGT GAA GTT ACA 1056 Leu Phe Asp Lys His Glu Leu AspSer Ala Val Ile Gly Glu Val Thr 340 345 350 GAT ACA AAT CGT TTT GTT TTAACA TAT GAT GAC GAA GTT TAT GCT GAC 1104 Asp Thr Asn Arg Phe Val Leu ThrTyr Asp Asp Glu Val Tyr Ala Asp 355 360 365 ATT CCA GTT GAA CCA CTA GCTGAT GAA GCA CCT GTA TAT ATT TTA GAA 1152 Ile Pro Val Glu Pro Leu Ala AspGlu Ala Pro Val Tyr Ile Leu Glu 370 375 380 GGA GAA GAA AAA GAT TAT AATACT TCT AAA AAT GAT TAT ACA CAC ATC 1200 Gly Glu Glu Lys Asp Tyr Asn ThrSer Lys Asn Asp Tyr Thr His Ile 385 390 395 400 GAT GTT AAA GAT ACT TTCTTT AAA TTA CTT AAG CAT CCG ACT ATA GCA 1248 Asp Val Lys Asp Thr Phe PheLys Leu Leu Lys His Pro Thr Ile Ala 405 410 415 TCT AAA CAC TAT TTA TATGAT CAA TAC GAC CAA CAA GTT GGT GCC AAT 1296 Ser Lys His Tyr Leu Tyr AspGln Tyr Asp Gln Gln Val Gly Ala Asn 420 425 430 ACG ATA ATT AAG CCA GGACTT CAA GCA TCG GTA GTA CGT GTG GAA GGC 1344 Thr Ile Ile Lys Pro Gly LeuGln Ala Ser Val Val Arg Val Glu Gly 435 440 445 ACA AAT AAG GCA ATT GCTTCA ACA ATT GAT GGT GAA GCG CGT TAT GTA 1392 Thr Asn Lys Ala Ile Ala SerThr Ile Asp Gly Glu Ala Arg Tyr Val 450 455 460 TAT AAC AAT CCA TAT GAAGGT GGA AAG ATG GTA GTA GCT GAA GCT TAT 1440 Tyr Asn Asn Pro Tyr Glu GlyGly Lys Met Val Val Ala Glu Ala Tyr 465 470 475 480 CGA AAT TTA ATT GCCGTG GGT GCA ACA CCA TTA GCA ATG ACA GAT TGT 1488 Arg Asn Leu Ile Ala ValGly Ala Thr Pro Leu Ala Met Thr Asp Cys 485 490 495 TTA AAT TAT GGT TCTCCT GAA AAG AAA GAA ATC TAT CAA CAG TTG ATA 1536 Leu Asn Tyr Gly Ser ProGlu Lys Lys Glu Ile Tyr Gln Gln Leu Ile 500 505 510 GAT TCA ACG AAA GGTATG GCA GAA GCA TGC GAC ATT CTT AAG ACA CCA 1584 Asp Ser Thr Lys Gly MetAla Glu Ala Cys Asp Ile Leu Lys Thr Pro 515 520 525 GTA GTT TCT GGT AATGTA TCT TTA TAT AAC GAA ACG AAA GGT ACT TCT 1632 Val Val Ser Gly Asn ValSer Leu Tyr Asn Glu Thr Lys Gly Thr Ser 530 535 540 ATT TTC CCA ACA CCAGTT GTT GGA ATG GTA GGT TTG ATT GAA AAT GTA 1680 Ile Phe Pro Thr Pro ValVal Gly Met Val Gly Leu Ile Glu Asn Val 545 550 555 560 AAT TAT TTA AATGAT TTT GAA CCT CAA GTT GGA GAT AAA TTA TAT TTA 1728 Asn Tyr Leu Asn AspPhe Glu Pro Gln Val Gly Asp Lys Leu Tyr Leu 565 570 575 ATC GGT GAT ACTAAG GAC GAC TTT GGT GGT AGT CAA CTT GAA AAG TTA 1776 Ile Gly Asp Thr LysAsp Asp Phe Gly Gly Ser Gln Leu Glu Lys Leu 580 585 590 ATT TAT GGC AAAGTT AAT CAT GAA TTT GAG TCA TTA GAT TTG AGT TCA 1824 Ile Tyr Gly Lys ValAsn His Glu Phe Glu Ser Leu Asp Leu Ser Ser 595 600 605 GAA GTT GAA AAAGGT GAA TCA ATC AAG ACC GCT ATT CGT GAA GGA CTA 1872 Glu Val Glu Lys GlyGlu Ser Ile Lys Thr Ala Ile Arg Glu Gly Leu 610 615 620 TTA TCA CAT GTTCAA ACA GTT GGT AAA GGT GGC TTA CTG ATT ACC TTA 1920 Leu Ser His Val GlnThr Val Gly Lys Gly Gly Leu Leu Ile Thr Leu 625 630 635 640 GCT AAA CTAAGT GCG CAT TAC GGT TTA GGA TTA AAA TCT TCA ATA GAT 1968 Ala Lys Leu SerAla His Tyr Gly Leu Gly Leu Lys Ser Ser Ile Asp 645 650 655 ATA ACA AATGCA CAA TTG TTT AGT GAG ACG CAA GGC CGA TAT GTT GTT 2016 Ile Thr Asn AlaGln Leu Phe Ser Glu Thr Gln Gly Arg Tyr Val Val 660 665 670 TCT GTT AAATCA GGT AAA ACT TTA AAT ATT GAT AAT GCA ATA GAA ATT 2064 Ser Val Lys SerGly Lys Thr Leu Asn Ile Asp Asn Ala Ile Glu Ile 675 680 685 GGA CTT TTAACA GAT AGT GAT AAT TYC AAG GTA ACA ACA CCA TAT ACA 2112 Gly Leu Leu ThrAsp Ser Asp Asn Xaa Lys Val Thr Thr Pro Tyr Thr 690 695 700 GAG ATT AGTGAA AAT GTT TCA GAT ATT AAA CAA ATA TGG GAA GGG GCA 2160 Glu Ile Ser GluAsn Val Ser Asp Ile Lys Gln Ile Trp Glu Gly Ala 705 710 715 720 ATT GCTCAA TGT TTA ACT ACT CAG GAT TAA 2190 Ile Ala Gln Cys Leu Thr Thr Gln Asp725 729 amino acids amino acid linear protein 30 Met Ser Lys Phe Ile GluPro Ser Val Glu Glu Ile Lys Leu Glu Lys 1 5 10 15 Val Tyr Gln Asp MetGly Leu Ser Asp Gln Glu Tyr Glu Lys Val Cys 20 25 30 Asp Ile Leu Gly ArgGln Pro Asn Phe Thr Glu Thr Gly Ile Phe Ser 35 40 45 Val Met Trp Ser GluHis Cys Ser Tyr Lys His Ser Lys Pro Phe Leu 50 55 60 Lys Gln Phe Pro ThrSer Gly Asp His Val Leu Met Gly Pro Gly Glu 65 70 75 80 Gly Ala Gly ValVal Asp Ile Gly Asp Asn Gln Ala Val Val Phe Lys 85 90 95 Val Glu Ser HisAsn His Pro Ser Ala Ile Glu Pro Tyr Gln Gly Ala 100 105 110 Ala Thr GlyVal Gly Gly Ile Ile Arg Asp Ile Val Ser Ile Gly Ala 115 120 125 Arg ProIle Asn Leu Leu Asn Ser Leu Arg Phe Gly Glu Leu Asp Asn 130 135 140 LysGln Asn Gln Arg Leu Leu Lys Gly Val Val Lys Gly Ile Gly Gly 145 150 155160 Tyr Gly Asn Cys Ile Gly Ile Pro Thr Thr Ala Gly Glu Ile Glu Phe 165170 175 Asp Glu Arg Tyr Asp Gly Asn Pro Leu Val Asn Ala Met Cys Val Gly180 185 190 Val Ile Asn His Asp Met Ile Gln Lys Gly Thr Ala Lys Gly ValGly 195 200 205 Asn Ser Val Ile Tyr Val Gly Leu Lys Thr Gly Arg Asp GlyIle His 210 215 220 Gly Ala Thr Phe Ala Ser Glu Glu Leu Thr Glu Glu SerGlu Ser Lys 225 230 235 240 Arg Pro Ser Val Gln Ile Gly Asp Pro Phe ValGly Lys Lys Leu Met 245 250 255 Glu Ala Thr Leu Glu Ala Ile Thr Phe AspGlu Leu Val Gly Ile Gln 260 265 270 Asp Met Gly Ala Ala Gly Leu Thr SerSer Ser Ser Glu Met Ala Ala 275 280 285 Lys Gly Gly Ser Gly Leu His LeuArg Leu Glu Gln Val Pro Thr Arg 290 295 300 Glu Pro Gly Ile Ser Pro TyrGlu Met Met Leu Ser Glu Thr Gln Glu 305 310 315 320 Arg Met Leu Leu ValVal Glu Lys Gly Thr Glu Gln Lys Phe Leu Asp 325 330 335 Leu Phe Asp LysHis Glu Leu Asp Ser Ala Val Ile Gly Glu Val Thr 340 345 350 Asp Thr AsnArg Phe Val Leu Thr Tyr Asp Asp Glu Val Tyr Ala Asp 355 360 365 Ile ProVal Glu Pro Leu Ala Asp Glu Ala Pro Val Tyr Ile Leu Glu 370 375 380 GlyGlu Glu Lys Asp Tyr Asn Thr Ser Lys Asn Asp Tyr Thr His Ile 385 390 395400 Asp Val Lys Asp Thr Phe Phe Lys Leu Leu Lys His Pro Thr Ile Ala 405410 415 Ser Lys His Tyr Leu Tyr Asp Gln Tyr Asp Gln Gln Val Gly Ala Asn420 425 430 Thr Ile Ile Lys Pro Gly Leu Gln Ala Ser Val Val Arg Val GluGly 435 440 445 Thr Asn Lys Ala Ile Ala Ser Thr Ile Asp Gly Glu Ala ArgTyr Val 450 455 460 Tyr Asn Asn Pro Tyr Glu Gly Gly Lys Met Val Val AlaGlu Ala Tyr 465 470 475 480 Arg Asn Leu Ile Ala Val Gly Ala Thr Pro LeuAla Met Thr Asp Cys 485 490 495 Leu Asn Tyr Gly Ser Pro Glu Lys Lys GluIle Tyr Gln Gln Leu Ile 500 505 510 Asp Ser Thr Lys Gly Met Ala Glu AlaCys Asp Ile Leu Lys Thr Pro 515 520 525 Val Val Ser Gly Asn Val Ser LeuTyr Asn Glu Thr Lys Gly Thr Ser 530 535 540 Ile Phe Pro Thr Pro Val ValGly Met Val Gly Leu Ile Glu Asn Val 545 550 555 560 Asn Tyr Leu Asn AspPhe Glu Pro Gln Val Gly Asp Lys Leu Tyr Leu 565 570 575 Ile Gly Asp ThrLys Asp Asp Phe Gly Gly Ser Gln Leu Glu Lys Leu 580 585 590 Ile Tyr GlyLys Val Asn His Glu Phe Glu Ser Leu Asp Leu Ser Ser 595 600 605 Glu ValGlu Lys Gly Glu Ser Ile Lys Thr Ala Ile Arg Glu Gly Leu 610 615 620 LeuSer His Val Gln Thr Val Gly Lys Gly Gly Leu Leu Ile Thr Leu 625 630 635640 Ala Lys Leu Ser Ala His Tyr Gly Leu Gly Leu Lys Ser Ser Ile Asp 645650 655 Ile Thr Asn Ala Gln Leu Phe Ser Glu Thr Gln Gly Arg Tyr Val Val660 665 670 Ser Val Lys Ser Gly Lys Thr Leu Asn Ile Asp Asn Ala Ile GluIle 675 680 685 Gly Leu Leu Thr Asp Ser Asp Asn Xaa Lys Val Thr Thr ProTyr Thr 690 695 700 Glu Ile Ser Glu Asn Val Ser Asp Ile Lys Gln Ile TrpGlu Gly Ala 705 710 715 720 Ile Ala Gln Cys Leu Thr Thr Gln Asp 725 729base pairs nucleic acid single linear other nucleic acid /desc = ”DNA(genomic) (p10b18/p6b18)“ CDS 1..726 31 ATG ACT AAA TTA TTT ATA CCT TATATT ATG GGC AAT AAA GAT TTG ATT 48 Met Thr Lys Leu Phe Ile Pro Tyr IleMet Gly Asn Lys Asp Leu Ile 1 5 10 15 GAA AAT GCA ACA TTG TTG AGT GAAAAT GGT GCA GAT ATA ATT GAA ATT 96 Glu Asn Ala Thr Leu Leu Ser Glu AsnGly Ala Asp Ile Ile Glu Ile 20 25 30 GGA GTA CCT TTC TCT GAT CCG GTT GCTGAT GGT CCA GTT ATC ATG GAA 144 Gly Val Pro Phe Ser Asp Pro Val Ala AspGly Pro Val Ile Met Glu 35 40 45 GCA GGT CAA CAA GCG ATT AAA CAA GGC ATCACG ATA GAT TAT ATT TTC 192 Ala Gly Gln Gln Ala Ile Lys Gln Gly Ile ThrIle Asp Tyr Ile Phe 50 55 60 AAT CAA TTA GAA AAA CAT GGT GAT CAA ATT AAGTGT AAC TAT GTA TTA 240 Asn Gln Leu Glu Lys His Gly Asp Gln Ile Lys CysAsn Tyr Val Leu 65 70 75 80 ATG ACG TAT TAT AAT ATT ATT TGT CAT TAT GGAGAA CAA GCG TTT TTT 288 Met Thr Tyr Tyr Asn Ile Ile Cys His Tyr Gly GluGln Ala Phe Phe 85 90 95 GAA AAA TGT CGA GAT ACT GGT GTC TAC GGC TTA ATTATT CCT GAT TTA 336 Glu Lys Cys Arg Asp Thr Gly Val Tyr Gly Leu Ile IlePro Asp Leu 100 105 110 CCA TAT GAA TTA TCG CAG CGT TTA AAA CAA CAA TTTAGT CAC TAT GGC 384 Pro Tyr Glu Leu Ser Gln Arg Leu Lys Gln Gln Phe SerHis Tyr Gly 115 120 125 GTC AAA ATC ATA TCG TTA GTT GCG ATG ACT ACT GATGAC AAA CGT ATA 432 Val Lys Ile Ile Ser Leu Val Ala Met Thr Thr Asp AspLys Arg Ile 130 135 140 AAA GAT ATC GTA TCC CAT GCG GAA GGC TTT ATT TATACT GTG ACG ATG 480 Lys Asp Ile Val Ser His Ala Glu Gly Phe Ile Tyr ThrVal Thr Met 145 150 155 160 AAT GCG ACA ACA GGG CAA AAC GGT GCG TTT CATCCA GAA TTA AAA CGA 528 Asn Ala Thr Thr Gly Gln Asn Gly Ala Phe His ProGlu Leu Lys Arg 165 170 175 AAA ATT GAG TCA ATT AAA GCG ATA GCC AAT GTGCCA GTT GTC GCA GGA 576 Lys Ile Glu Ser Ile Lys Ala Ile Ala Asn Val ProVal Val Ala Gly 180 185 190 TTT GGT ATA AGA ACA CCA CAA CAT GTT GCA GATATA AAA GAG GTT GCA 624 Phe Gly Ile Arg Thr Pro Gln His Val Ala Asp IleLys Glu Val Ala 195 200 205 GAT GGC ATT GTC ATT GGT AGC GAA ATC GTT AAGCGA TTT AAA TCT AAC 672 Asp Gly Ile Val Ile Gly Ser Glu Ile Val Lys ArgPhe Lys Ser Asn 210 215 220 ACG CGT GAG GAA ATC ATT AAA TAT TTA CAA TCTATC CAA CAA ACA TTG 720 Thr Arg Glu Glu Ile Ile Lys Tyr Leu Gln Ser IleGln Gln Thr Leu 225 230 235 240 AAT AAT TAA 729 Asn Asn 242 amino acidsamino acid linear protein 32 Met Thr Lys Leu Phe Ile Pro Tyr Ile Met GlyAsn Lys Asp Leu Ile 1 5 10 15 Glu Asn Ala Thr Leu Leu Ser Glu Asn GlyAla Asp Ile Ile Glu Ile 20 25 30 Gly Val Pro Phe Ser Asp Pro Val Ala AspGly Pro Val Ile Met Glu 35 40 45 Ala Gly Gln Gln Ala Ile Lys Gln Gly IleThr Ile Asp Tyr Ile Phe 50 55 60 Asn Gln Leu Glu Lys His Gly Asp Gln IleLys Cys Asn Tyr Val Leu 65 70 75 80 Met Thr Tyr Tyr Asn Ile Ile Cys HisTyr Gly Glu Gln Ala Phe Phe 85 90 95 Glu Lys Cys Arg Asp Thr Gly Val TyrGly Leu Ile Ile Pro Asp Leu 100 105 110 Pro Tyr Glu Leu Ser Gln Arg LeuLys Gln Gln Phe Ser His Tyr Gly 115 120 125 Val Lys Ile Ile Ser Leu ValAla Met Thr Thr Asp Asp Lys Arg Ile 130 135 140 Lys Asp Ile Val Ser HisAla Glu Gly Phe Ile Tyr Thr Val Thr Met 145 150 155 160 Asn Ala Thr ThrGly Gln Asn Gly Ala Phe His Pro Glu Leu Lys Arg 165 170 175 Lys Ile GluSer Ile Lys Ala Ile Ala Asn Val Pro Val Val Ala Gly 180 185 190 Phe GlyIle Arg Thr Pro Gln His Val Ala Asp Ile Lys Glu Val Ala 195 200 205 AspGly Ile Val Ile Gly Ser Glu Ile Val Lys Arg Phe Lys Ser Asn 210 215 220Thr Arg Glu Glu Ile Ile Lys Tyr Leu Gln Ser Ile Gln Gln Thr Leu 225 230235 240 Asn Asn 1215 base pairs nucleic acid single linear other nucleicacid /desc = ”DNA (genomic) (p10b66)“ CDS 1..1212 33 ATG AAT AAA CAA ATACAA ACA GAA GCA GAT GAA TTA GGT TTC TTT GGT 48 Met Asn Lys Gln Ile GlnThr Glu Ala Asp Glu Leu Gly Phe Phe Gly 1 5 10 15 GAA TAT GGA GGG CAATAT GTT CCA GAA ACA TTA ATG CCA GCA ATT ATT 96 Glu Tyr Gly Gly Gln TyrVal Pro Glu Thr Leu Met Pro Ala Ile Ile 20 25 30 GAG TTG AAA AAA GCT TATAAA GAG GCA AAA GCA GAC CCA GAG TTT CAA 144 Glu Leu Lys Lys Ala Tyr LysGlu Ala Lys Ala Asp Pro Glu Phe Gln 35 40 45 AGA GAA CTG GAA TAC TAT TTATCA GAG TAT GTA GGA CGC GCG ACA CCA 192 Arg Glu Leu Glu Tyr Tyr Leu SerGlu Tyr Val Gly Arg Ala Thr Pro 50 55 60 CTT ACA TAT GCT GCA TCA TAT ACTGAA AGC CTA GGT GGC GCT AAA ATA 240 Leu Thr Tyr Ala Ala Ser Tyr Thr GluSer Leu Gly Gly Ala Lys Ile 65 70 75 80 TAT TTG AAA CGA GAG GAT CTA AATCAT ACA GGC GCC CAT AAA ATT AAT 288 Tyr Leu Lys Arg Glu Asp Leu Asn HisThr Gly Ala His Lys Ile Asn 85 90 95 AAT GCG TTA GGT CAA GCG TTG CTT GCTAAA AGA ATG GGC AAG AAG AAG 336 Asn Ala Leu Gly Gln Ala Leu Leu Ala LysArg Met Gly Lys Lys Lys 100 105 110 CTT GTT GCT GAA ACT GGT GCG GGT CAACAT GGT GTA GCT AGT GCT ACG 384 Leu Val Ala Glu Thr Gly Ala Gly Gln HisGly Val Ala Ser Ala Thr 115 120 125 GTT GCT GCA TTA TTT GAT ATG GAA CTTGTT GTC TTT ATG GGA AGT GAA 432 Val Ala Ala Leu Phe Asp Met Glu Leu ValVal Phe Met Gly Ser Glu 130 135 140 GAT ATT AAA AGA CAA CAA CTT AAT GTATTT AGA ATG GAA TTA CTT GGT 480 Asp Ile Lys Arg Gln Gln Leu Asn Val PheArg Met Glu Leu Leu Gly 145 150 155 160 GCA AAG GTT GTG GCA GTT GAA GATGGT CAA GGG ACT TTA TCG GAT GCA 528 Ala Lys Val Val Ala Val Glu Asp GlyGln Gly Thr Leu Ser Asp Ala 165 170 175 GTT AAT AAA GCA TTG CAA TAT TGGGTA AGT CAT GTA GAT GAT ACA CAT 576 Val Asn Lys Ala Leu Gln Tyr Trp ValSer His Val Asp Asp Thr His 180 185 190 TAT TTA TTA GGT TCT GCA TTA GGTCCA GAC CCG TTC CCA ACG ATT GTT 624 Tyr Leu Leu Gly Ser Ala Leu Gly ProAsp Pro Phe Pro Thr Ile Val 195 200 205 AGA GAT TTT CAG AGT GTG ATT GGTAAA GAA ATA AAA TCA CAG ATA TTG 672 Arg Asp Phe Gln Ser Val Ile Gly LysGlu Ile Lys Ser Gln Ile Leu 210 215 220 AAG AAA GAA GGT CGA CTT CCG GATGCA ATT GTA GCA TGT ATC GGT GGT 720 Lys Lys Glu Gly Arg Leu Pro Asp AlaIle Val Ala Cys Ile Gly Gly 225 230 235 240 GGC TCA AAT GCA ATC GGT ACATTT TAT CCA TTT ATT AAA GAT GAT GTT 768 Gly Ser Asn Ala Ile Gly Thr PheTyr Pro Phe Ile Lys Asp Asp Val 245 250 255 GCA TTA TAC GGT GTT GAA GCCGCA GGT CAA GGC GAT GAT ACT GAT AAA 816 Ala Leu Tyr Gly Val Glu Ala AlaGly Gln Gly Asp Asp Thr Asp Lys 260 265 270 CAT GCA CTT GCA ATT GGC AAAGGA TCA CCT GGC GTA TTA CAT GGT ACT 864 His Ala Leu Ala Ile Gly Lys GlySer Pro Gly Val Leu His Gly Thr 275 280 285 AAA ATG TAT TTA ATT CAA GATGAA GAT GGG CAA GTG CAA CTA GCA CAT 912 Lys Met Tyr Leu Ile Gln Asp GluAsp Gly Gln Val Gln Leu Ala His 290 295 300 TCT ATT TCA GCA GGA CTT GATTAT CCT GGT ATT GGA CCA GAA CAT TCT 960 Ser Ile Ser Ala Gly Leu Asp TyrPro Gly Ile Gly Pro Glu His Ser 305 310 315 320 TAT TAC CAC GAC ATT GGTAGA GTA ACT TTT GAA AAT GCT AGT GAT ACA 1008 Tyr Tyr His Asp Ile Gly ArgVal Thr Phe Glu Asn Ala Ser Asp Thr 325 330 335 CAA GCA ATG AAT GCT TTAATC AAC TTT ACA AAA CAT GAA GGT ATT ATA 1056 Gln Ala Met Asn Ala Leu IleAsn Phe Thr Lys His Glu Gly Ile Ile 340 345 350 CCT GCA ATT GAA AGT GCACAT GCA CTG AGT TAT GTT GAA AGA CTA GCG 1104 Pro Ala Ile Glu Ser Ala HisAla Leu Ser Tyr Val Glu Arg Leu Ala 355 360 365 CCT ACG ATG TCG AAA GAAGAT ATT ATT GTA GTA ACT ATT TCT GGA CGT 1152 Pro Thr Met Ser Lys Glu AspIle Ile Val Val Thr Ile Ser Gly Arg 370 375 380 GGC GAT AAA GAT ATG GAAACA ATT AGA CAA TAT ATG GTA GAG CGA GGT 1200 Gly Asp Lys Asp Met Glu ThrIle Arg Gln Tyr Met Val Glu Arg Gly 385 390 395 400 CTT GCA AAT GAC TAA1215 Leu Ala Asn Asp 404 amino acids amino acid linear protein 34 MetAsn Lys Gln Ile Gln Thr Glu Ala Asp Glu Leu Gly Phe Phe Gly 1 5 10 15Glu Tyr Gly Gly Gln Tyr Val Pro Glu Thr Leu Met Pro Ala Ile Ile 20 25 30Glu Leu Lys Lys Ala Tyr Lys Glu Ala Lys Ala Asp Pro Glu Phe Gln 35 40 45Arg Glu Leu Glu Tyr Tyr Leu Ser Glu Tyr Val Gly Arg Ala Thr Pro 50 55 60Leu Thr Tyr Ala Ala Ser Tyr Thr Glu Ser Leu Gly Gly Ala Lys Ile 65 70 7580 Tyr Leu Lys Arg Glu Asp Leu Asn His Thr Gly Ala His Lys Ile Asn 85 9095 Asn Ala Leu Gly Gln Ala Leu Leu Ala Lys Arg Met Gly Lys Lys Lys 100105 110 Leu Val Ala Glu Thr Gly Ala Gly Gln His Gly Val Ala Ser Ala Thr115 120 125 Val Ala Ala Leu Phe Asp Met Glu Leu Val Val Phe Met Gly SerGlu 130 135 140 Asp Ile Lys Arg Gln Gln Leu Asn Val Phe Arg Met Glu LeuLeu Gly 145 150 155 160 Ala Lys Val Val Ala Val Glu Asp Gly Gln Gly ThrLeu Ser Asp Ala 165 170 175 Val Asn Lys Ala Leu Gln Tyr Trp Val Ser HisVal Asp Asp Thr His 180 185 190 Tyr Leu Leu Gly Ser Ala Leu Gly Pro AspPro Phe Pro Thr Ile Val 195 200 205 Arg Asp Phe Gln Ser Val Ile Gly LysGlu Ile Lys Ser Gln Ile Leu 210 215 220 Lys Lys Glu Gly Arg Leu Pro AspAla Ile Val Ala Cys Ile Gly Gly 225 230 235 240 Gly Ser Asn Ala Ile GlyThr Phe Tyr Pro Phe Ile Lys Asp Asp Val 245 250 255 Ala Leu Tyr Gly ValGlu Ala Ala Gly Gln Gly Asp Asp Thr Asp Lys 260 265 270 His Ala Leu AlaIle Gly Lys Gly Ser Pro Gly Val Leu His Gly Thr 275 280 285 Lys Met TyrLeu Ile Gln Asp Glu Asp Gly Gln Val Gln Leu Ala His 290 295 300 Ser IleSer Ala Gly Leu Asp Tyr Pro Gly Ile Gly Pro Glu His Ser 305 310 315 320Tyr Tyr His Asp Ile Gly Arg Val Thr Phe Glu Asn Ala Ser Asp Thr 325 330335 Gln Ala Met Asn Ala Leu Ile Asn Phe Thr Lys His Glu Gly Ile Ile 340345 350 Pro Ala Ile Glu Ser Ala His Ala Leu Ser Tyr Val Glu Arg Leu Ala355 360 365 Pro Thr Met Ser Lys Glu Asp Ile Ile Val Val Thr Ile Ser GlyArg 370 375 380 Gly Asp Lys Asp Met Glu Thr Ile Arg Gln Tyr Met Val GluArg Gly 385 390 395 400 Leu Ala Asn Asp 1008 base pairs nucleic acidsingle linear other nucleic acid /desc = ”DNA (genomic) (p10c34)“ CDS1..1005 35 ATG AAA ACC ATG ACA TTA CTA ACA AGA ATA AAA ACT GAA ACT ATATTA 48 Met Lys Thr Met Thr Leu Leu Thr Arg Ile Lys Thr Glu Thr Ile Leu 15 10 15 CTT GAA AGC GAC ATT AAA GAG CTA ATC GAT ATA CTT ATT TCT CCT AGT96 Leu Glu Ser Asp Ile Lys Glu Leu Ile Asp Ile Leu Ile Ser Pro Ser 20 2530 ATT GGA ACT GAT ATT AAA TAT GAA TTA CTT AGT TCC TAT TCG GAG CGA 144Ile Gly Thr Asp Ile Lys Tyr Glu Leu Leu Ser Ser Tyr Ser Glu Arg 35 40 45GAA ATC CAA CAA CAA GAA TTA ACA TAT ATT GTA CGT AGC TTA ATT AAT 192 GluIle Gln Gln Gln Glu Leu Thr Tyr Ile Val Arg Ser Leu Ile Asn 50 55 60 ACAATG TAT CCA CAT CAA CCA TGT TAT GAA GGG GCT ATG TGT GTG TGC 240 Thr MetTyr Pro His Gln Pro Cys Tyr Glu Gly Ala Met Cys Val Cys 65 70 75 80 GGCACA GGT GGT GAC AAG TCA AAT AGT TTC AAC ATT TCA ACG ACT GTT 288 Gly ThrGly Gly Asp Lys Ser Asn Ser Phe Asn Ile Ser Thr Thr Val 85 90 95 GCT TTTGTT GTA GCA AGT GCT GGC GTA AAA GTT ATA AAA CAT GGT AAT 336 Ala Phe ValVal Ala Ser Ala Gly Val Lys Val Ile Lys His Gly Asn 100 105 110 AAA AGTATT ACC TCA AAT TCA GGT AGT ACG GAT TTG TTA AAT CAA ATG 384 Lys Ser IleThr Ser Asn Ser Gly Ser Thr Asp Leu Leu Asn Gln Met 115 120 125 AAC ATACAA ACA ACA ACT GTT GAT GAT ACA CCT AAC CAA TTA AAT GAN 432 Asn Ile GlnThr Thr Thr Val Asp Asp Thr Pro Asn Gln Leu Asn Xaa 130 135 140 AAA GACCTT GTA TTC ATT GGT GCA ACT GAA TCA TAT CCA ATC ATG AAG 480 Lys Asp LeuVal Phe Ile Gly Ala Thr Glu Ser Tyr Pro Ile Met Lys 145 150 155 160 TATATG CAA CCA GTT AGA AAA ATG ATT GGA AAG CCT ACA ATA TTA AAC 528 Tyr MetGln Pro Val Arg Lys Met Ile Gly Lys Pro Thr Ile Leu Asn 165 170 175 CTTGTG GGT CCA TTA ATT AAT CCA TAT CAC TTA ACG TAT CAA ATG GTA 576 Leu ValGly Pro Leu Ile Asn Pro Tyr His Leu Thr Tyr Gln Met Val 180 185 190 GGCGTC TTT GAT CCT ACA AAG TTA AAG TTA GTT GCT AAA ACG ATT AAA 624 Gly ValPhe Asp Pro Thr Lys Leu Lys Leu Val Ala Lys Thr Ile Lys 195 200 205 GATTTA GGT AGA AAA CGT GCA ATC GTT TTA CAT GGT GCA AAT GGT ATG 672 Asp LeuGly Arg Lys Arg Ala Ile Val Leu His Gly Ala Asn Gly Met 210 215 220 GATGAA GCA ACA CTA TCT GGT GAT AAT TTG ATA TAT GAA TTG ACT GAA 720 Asp GluAla Thr Leu Ser Gly Asp Asn Leu Ile Tyr Glu Leu Thr Glu 225 230 235 240GAT GGA GAA ATC AAA AAT TAC ACA TTA AAT GCG ACT GAT TAT GGT TTG 768 AspGly Glu Ile Lys Asn Tyr Thr Leu Asn Ala Thr Asp Tyr Gly Leu 245 250 255AAA CAT GCG CCG AAT AGT GAT TTT AAA GGC GGT TCA CCT GAA GAA AAT 816 LysHis Ala Pro Asn Ser Asp Phe Lys Gly Gly Ser Pro Glu Glu Asn 260 265 270TTA GCA ATC TCC CTT AAT ATC TTG AAT GGT AAA GAT CAG TCA RGT CGA 864 LeuAla Ile Ser Leu Asn Ile Leu Asn Gly Lys Asp Gln Ser Xaa Arg 275 280 285CGT GAT GTT GTC TTA CTA AAT GCG GGT TTA AGC CTT TAT GTT GCA GAG 912 ArgAsp Val Val Leu Leu Asn Ala Gly Leu Ser Leu Tyr Val Ala Glu 290 295 300AAA RTG GAT ACC ATC GCA GAA GGC ATA GAA CTT GCA ACT ACA TTG ATT 960 LysXaa Asp Thr Ile Ala Glu Gly Ile Glu Leu Ala Thr Thr Leu Ile 305 310 315320 GAT AAT GGT GAA GCA TTG GAA AAA TAC CAT CAA ATG AGA GGT GAA 1005 AspAsn Gly Glu Ala Leu Glu Lys Tyr His Gln Met Arg Gly Glu 325 330 335 TAA1008 335 amino acids amino acid linear protein 36 Met Lys Thr Met ThrLeu Leu Thr Arg Ile Lys Thr Glu Thr Ile Leu 1 5 10 15 Leu Glu Ser AspIle Lys Glu Leu Ile Asp Ile Leu Ile Ser Pro Ser 20 25 30 Ile Gly Thr AspIle Lys Tyr Glu Leu Leu Ser Ser Tyr Ser Glu Arg 35 40 45 Glu Ile Gln GlnGln Glu Leu Thr Tyr Ile Val Arg Ser Leu Ile Asn 50 55 60 Thr Met Tyr ProHis Gln Pro Cys Tyr Glu Gly Ala Met Cys Val Cys 65 70 75 80 Gly Thr GlyGly Asp Lys Ser Asn Ser Phe Asn Ile Ser Thr Thr Val 85 90 95 Ala Phe ValVal Ala Ser Ala Gly Val Lys Val Ile Lys His Gly Asn 100 105 110 Lys SerIle Thr Ser Asn Ser Gly Ser Thr Asp Leu Leu Asn Gln Met 115 120 125 AsnIle Gln Thr Thr Thr Val Asp Asp Thr Pro Asn Gln Leu Asn Xaa 130 135 140Lys Asp Leu Val Phe Ile Gly Ala Thr Glu Ser Tyr Pro Ile Met Lys 145 150155 160 Tyr Met Gln Pro Val Arg Lys Met Ile Gly Lys Pro Thr Ile Leu Asn165 170 175 Leu Val Gly Pro Leu Ile Asn Pro Tyr His Leu Thr Tyr Gln MetVal 180 185 190 Gly Val Phe Asp Pro Thr Lys Leu Lys Leu Val Ala Lys ThrIle Lys 195 200 205 Asp Leu Gly Arg Lys Arg Ala Ile Val Leu His Gly AlaAsn Gly Met 210 215 220 Asp Glu Ala Thr Leu Ser Gly Asp Asn Leu Ile TyrGlu Leu Thr Glu 225 230 235 240 Asp Gly Glu Ile Lys Asn Tyr Thr Leu AsnAla Thr Asp Tyr Gly Leu 245 250 255 Lys His Ala Pro Asn Ser Asp Phe LysGly Gly Ser Pro Glu Glu Asn 260 265 270 Leu Ala Ile Ser Leu Asn Ile LeuAsn Gly Lys Asp Gln Ser Xaa Arg 275 280 285 Arg Asp Val Val Leu Leu AsnAla Gly Leu Ser Leu Tyr Val Ala Glu 290 295 300 Lys Xaa Asp Thr Ile AlaGlu Gly Ile Glu Leu Ala Thr Thr Leu Ile 305 310 315 320 Asp Asn Gly GluAla Leu Glu Lys Tyr His Gln Met Arg Gly Glu 325 330 335 1254 base pairsnucleic acid single linear other nucleic acid /desc = ”DNA (genomic)(p4c27)“ CDS 1..1251 37 ATG GCG GAA ATT AAG GTA CCT GAA TTA GCA GAA TCAATC TCA GAA GGA 48 Met Ala Glu Ile Lys Val Pro Glu Leu Ala Glu Ser IleSer Glu Gly 1 5 10 15 ACA ATA GCC CAA TGG TTA AAG CAG CCT GGT GAC TATGTA GAA CAG GGT 96 Thr Ile Ala Gln Trp Leu Lys Gln Pro Gly Asp Tyr ValGlu Gln Gly 20 25 30 GAA TAT CTG CTT GAA CTA GAA ACG GAT AAA GTG AAT GTTGAA TTG ACA 144 Glu Tyr Leu Leu Glu Leu Glu Thr Asp Lys Val Asn Val GluLeu Thr 35 40 45 GCA GAA GAA TCG GGT GTA CTT CAA GAG GTA TTG AAA GAT TCGGGT GAT 192 Ala Glu Glu Ser Gly Val Leu Gln Glu Val Leu Lys Asp Ser GlyAsp 50 55 60 ACC GTC CAG GTC GGA GAA ATT ATC GGT ACG ATT TCA GAA GGC GCGGGT 240 Thr Val Gln Val Gly Glu Ile Ile Gly Thr Ile Ser Glu Gly Ala Gly65 70 75 80 GAA AGT TCT GCC CCT GCT CCT ACA GAG AAA ACA GAA AGC AAG GAAAGC 288 Glu Ser Ser Ala Pro Ala Pro Thr Glu Lys Thr Glu Ser Lys Glu Ser85 90 95 GTA AAA GAA GAG AAA CAG GCT GAA CCA GCT GCA CAA GAG GTG AGC GAG336 Val Lys Glu Glu Lys Gln Ala Glu Pro Ala Ala Gln Glu Val Ser Glu 100105 110 GAA GCA CAA TCT GAA GCA AAA TCA AGA ACG ATC GCT TCT CCG TCG GCC384 Glu Ala Gln Ser Glu Ala Lys Ser Arg Thr Ile Ala Ser Pro Ser Ala 115120 125 CGT AAG CTT GCG AGA GAA AAA GGA ATT GAC CTG TCT CAA GTT CCA ACT432 Arg Lys Leu Ala Arg Glu Lys Gly Ile Asp Leu Ser Gln Val Pro Thr 130135 140 GGA GAT CCG CTT GGA AGA GTG CGC AAG CAG GAT GTC GAA GCG TAC GAA480 Gly Asp Pro Leu Gly Arg Val Arg Lys Gln Asp Val Glu Ala Tyr Glu 145150 155 160 AAA CCG GCA TCA AAA CCG GCT CCT CAG CAA AAG CAG CAG CCT CAGGCT 528 Lys Pro Ala Ser Lys Pro Ala Pro Gln Gln Lys Gln Gln Pro Gln Ala165 170 175 CAA AAA GCA CAG CAA AGC TTT GAC AAA CCT GTT GAA GTG CAA AAAATG 576 Gln Lys Ala Gln Gln Ser Phe Asp Lys Pro Val Glu Val Gln Lys Met180 185 190 TCA CGC CGC AGA CAA ACG ATT GCA AAA CGC CTT GTA GAG GTA CAGCAA 624 Ser Arg Arg Arg Gln Thr Ile Ala Lys Arg Leu Val Glu Val Gln Gln195 200 205 ACA TCA GCG ATG CTG ACT ACA TTT AAT GAA GTG GAC ATG ACG GCTGTC 672 Thr Ser Ala Met Leu Thr Thr Phe Asn Glu Val Asp Met Thr Ala Val210 215 220 ATG AAT CTC AGA AAA CGC CGC AAA GAT CAA TTT TTT GAG CAA AATGAA 720 Met Asn Leu Arg Lys Arg Arg Lys Asp Gln Phe Phe Glu Gln Asn Glu225 230 235 240 GTG AAG CTC GGC TTT ATG TCT TTC TTC ACG AAA GCG GTC GTGGCT GCA 768 Val Lys Leu Gly Phe Met Ser Phe Phe Thr Lys Ala Val Val AlaAla 245 250 255 TTG AAA AAA TAT CCG CTG TTG AAT GCA GAA ATT CAA GGC GATGAG TTG 816 Leu Lys Lys Tyr Pro Leu Leu Asn Ala Glu Ile Gln Gly Asp GluLeu 260 265 270 ATC GTT AAA AAA TTC TAC GAT ATC GGA ATC GCT GTT GCT GCTGTA GAA 864 Ile Val Lys Lys Phe Tyr Asp Ile Gly Ile Ala Val Ala Ala ValGlu 275 280 285 GGT CTT GTC GTT CCG GTT GTA CGG GAT GCG GAT CGC CTG ACATTT GCA 912 Gly Leu Val Val Pro Val Val Arg Asp Ala Asp Arg Leu Thr PheAla 290 295 300 GGA ATC GAA AAA GAG ATC GGC GAG CTT GCG AAA AAA GCA AGAAAC AAT 960 Gly Ile Glu Lys Glu Ile Gly Glu Leu Ala Lys Lys Ala Arg AsnAsn 305 310 315 320 AAA TTA ACC CTT AGC GAG CTT GAG GGA GGC TCC TTC ACGATT ACA AAC 1008 Lys Leu Thr Leu Ser Glu Leu Glu Gly Gly Ser Phe Thr IleThr Asn 325 330 335 GGA GGG ACT TTC GGT TCA TTG ATG TCA ACT CCA ATT TTAAAC AGC CCG 1056 Gly Gly Thr Phe Gly Ser Leu Met Ser Thr Pro Ile Leu AsnSer Pro 340 345 350 CAA GTC GGT ATA CTG GGC ATG CAT AAG ATT CAG CTG CGCCCT GTA GCC 1104 Gln Val Gly Ile Leu Gly Met His Lys Ile Gln Leu Arg ProVal Ala 355 360 365 ATT GAT GAA GAG CGT TTC GAA AAC CGT CCG ATG ATG TATATC GCT TTA 1152 Ile Asp Glu Glu Arg Phe Glu Asn Arg Pro Met Met Tyr IleAla Leu 370 375 380 TCT TAT GAT CAC CGA ATT GTA GAC GGT AAA GAA GCG GTTGGT TTC CTC 1200 Ser Tyr Asp His Arg Ile Val Asp Gly Lys Glu Ala Val GlyPhe Leu 385 390 395 400 GTG ACA ATC AAA AAT TTA CTG GAA GAT CCT GAA CAGCTT TTA TTA GAA 1248 Val Thr Ile Lys Asn Leu Leu Glu Asp Pro Glu Gln LeuLeu Leu Glu 405 410 415 GGA TAA 1254 Gly 417 amino acids amino acidlinear protein 38 Met Ala Glu Ile Lys Val Pro Glu Leu Ala Glu Ser IleSer Glu Gly 1 5 10 15 Thr Ile Ala Gln Trp Leu Lys Gln Pro Gly Asp TyrVal Glu Gln Gly 20 25 30 Glu Tyr Leu Leu Glu Leu Glu Thr Asp Lys Val AsnVal Glu Leu Thr 35 40 45 Ala Glu Glu Ser Gly Val Leu Gln Glu Val Leu LysAsp Ser Gly Asp 50 55 60 Thr Val Gln Val Gly Glu Ile Ile Gly Thr Ile SerGlu Gly Ala Gly 65 70 75 80 Glu Ser Ser Ala Pro Ala Pro Thr Glu Lys ThrGlu Ser Lys Glu Ser 85 90 95 Val Lys Glu Glu Lys Gln Ala Glu Pro Ala AlaGln Glu Val Ser Glu 100 105 110 Glu Ala Gln Ser Glu Ala Lys Ser Arg ThrIle Ala Ser Pro Ser Ala 115 120 125 Arg Lys Leu Ala Arg Glu Lys Gly IleAsp Leu Ser Gln Val Pro Thr 130 135 140 Gly Asp Pro Leu Gly Arg Val ArgLys Gln Asp Val Glu Ala Tyr Glu 145 150 155 160 Lys Pro Ala Ser Lys ProAla Pro Gln Gln Lys Gln Gln Pro Gln Ala 165 170 175 Gln Lys Ala Gln GlnSer Phe Asp Lys Pro Val Glu Val Gln Lys Met 180 185 190 Ser Arg Arg ArgGln Thr Ile Ala Lys Arg Leu Val Glu Val Gln Gln 195 200 205 Thr Ser AlaMet Leu Thr Thr Phe Asn Glu Val Asp Met Thr Ala Val 210 215 220 Met AsnLeu Arg Lys Arg Arg Lys Asp Gln Phe Phe Glu Gln Asn Glu 225 230 235 240Val Lys Leu Gly Phe Met Ser Phe Phe Thr Lys Ala Val Val Ala Ala 245 250255 Leu Lys Lys Tyr Pro Leu Leu Asn Ala Glu Ile Gln Gly Asp Glu Leu 260265 270 Ile Val Lys Lys Phe Tyr Asp Ile Gly Ile Ala Val Ala Ala Val Glu275 280 285 Gly Leu Val Val Pro Val Val Arg Asp Ala Asp Arg Leu Thr PheAla 290 295 300 Gly Ile Glu Lys Glu Ile Gly Glu Leu Ala Lys Lys Ala ArgAsn Asn 305 310 315 320 Lys Leu Thr Leu Ser Glu Leu Glu Gly Gly Ser PheThr Ile Thr Asn 325 330 335 Gly Gly Thr Phe Gly Ser Leu Met Ser Thr ProIle Leu Asn Ser Pro 340 345 350 Gln Val Gly Ile Leu Gly Met His Lys IleGln Leu Arg Pro Val Ala 355 360 365 Ile Asp Glu Glu Arg Phe Glu Asn ArgPro Met Met Tyr Ile Ala Leu 370 375 380 Ser Tyr Asp His Arg Ile Val AspGly Lys Glu Ala Val Gly Phe Leu 385 390 395 400 Val Thr Ile Lys Asn LeuLeu Glu Asp Pro Glu Gln Leu Leu Leu Glu 405 410 415 Gly 1254 base pairsnucleic acid single linear other nucleic acid /desc = ”DNA (genomic)(p4c52)“ CDS 1..1251 39 ATG GCG GAA ATT AAG GTA CCT GAA TTA GCA GAA TCAATC TCA GAA GGA 48 Met Ala Glu Ile Lys Val Pro Glu Leu Ala Glu Ser IleSer Glu Gly 1 5 10 15 ACA ATA GCC CAA TGG TTA AAG CAG CCT GGT GAC TATGTA GAA CAG GGT 96 Thr Ile Ala Gln Trp Leu Lys Gln Pro Gly Asp Tyr ValGlu Gln Gly 20 25 30 GAA TAT CTG CTT GAA CTA GAA ACG GAT AAA GTG AAT GTTGAA TTG ACA 144 Glu Tyr Leu Leu Glu Leu Glu Thr Asp Lys Val Asn Val GluLeu Thr 35 40 45 GCA GAA GAA TCG GGT GTA CTT CAA GAG GTA TTG AAA GAT TCGGGT GAT 192 Ala Glu Glu Ser Gly Val Leu Gln Glu Val Leu Lys Asp Ser GlyAsp 50 55 60 ACC GTC CAG GTC GGA GAA ATT ATC GGT ACG ATT TCA GAA GGC GCGGGT 240 Thr Val Gln Val Gly Glu Ile Ile Gly Thr Ile Ser Glu Gly Ala Gly65 70 75 80 GAA AGT TCT GCC CCT GCT CCT ACA GAG AAA ACA GAA AGC AAG GAAAGC 288 Glu Ser Ser Ala Pro Ala Pro Thr Glu Lys Thr Glu Ser Lys Glu Ser85 90 95 GTA AAA GAA GAG AAA CAG GCT GAA CCA GCT GCA CAA GAG GTG AGC GAG336 Val Lys Glu Glu Lys Gln Ala Glu Pro Ala Ala Gln Glu Val Ser Glu 100105 110 GAA GCA CAA TCT GAA GCA AAA TCA AGA ACG ATC GCT TCT CCG TCG GCC384 Glu Ala Gln Ser Glu Ala Lys Ser Arg Thr Ile Ala Ser Pro Ser Ala 115120 125 CGT AAG CTT GCG AGA GAA AAA GGA ATT GAC CTG TCT CAA GTT CCA ACT432 Arg Lys Leu Ala Arg Glu Lys Gly Ile Asp Leu Ser Gln Val Pro Thr 130135 140 GGA GAT CCG CTT GGA AGA GTG CGC AAG CAG GAT GTC GAA GCG TAC GAA480 Gly Asp Pro Leu Gly Arg Val Arg Lys Gln Asp Val Glu Ala Tyr Glu 145150 155 160 AAA CCG GCA TCA AAA CCG GCT CCT CAG CAA AAG CAG CAG CCT CAGGCT 528 Lys Pro Ala Ser Lys Pro Ala Pro Gln Gln Lys Gln Gln Pro Gln Ala165 170 175 CAA AAA GCA CAG CAA AGC TTT GAC AAA CCT GTT GAA GTG CAA AAAATG 576 Gln Lys Ala Gln Gln Ser Phe Asp Lys Pro Val Glu Val Gln Lys Met180 185 190 TCA CGC CGC AGA CAA ACG ATT GCA AAA CGC CTT GTA GAG GTA CAGCAA 624 Ser Arg Arg Arg Gln Thr Ile Ala Lys Arg Leu Val Glu Val Gln Gln195 200 205 ACA TCA GCG ATG CTG ACT ACA TTT AAT GAA GTG GAC ATG ACG GCTGTC 672 Thr Ser Ala Met Leu Thr Thr Phe Asn Glu Val Asp Met Thr Ala Val210 215 220 ATG AAT CTC AGA AAA CGC CGC AAA GAT CAA TTT TTT GAG CAA AATGAA 720 Met Asn Leu Arg Lys Arg Arg Lys Asp Gln Phe Phe Glu Gln Asn Glu225 230 235 240 GTG AAG CTC GGC TTT ATG TCT TTC TTC ACG AAA GCG GTC GTGGCT GCA 768 Val Lys Leu Gly Phe Met Ser Phe Phe Thr Lys Ala Val Val AlaAla 245 250 255 TTG AAA AAA TAT CCG CTG TTG AAT GCA GAA ATT CAA GGC GATGAG TTG 816 Leu Lys Lys Tyr Pro Leu Leu Asn Ala Glu Ile Gln Gly Asp GluLeu 260 265 270 ATC GTT AAA AAA TTC TAC GAT ATC GGA ATC GCT GTT GCT GCTGTA GAA 864 Ile Val Lys Lys Phe Tyr Asp Ile Gly Ile Ala Val Ala Ala ValGlu 275 280 285 GGT CTT GTC GTT CCG GTT GTA CGG GAT GCG GAT CGC CTG ACATTT GCA 912 Gly Leu Val Val Pro Val Val Arg Asp Ala Asp Arg Leu Thr PheAla 290 295 300 GGA ATC GAA AAA GAG ATC GGC GAG CTT GCG AAA AAA GCA AGAAAC AAT 960 Gly Ile Glu Lys Glu Ile Gly Glu Leu Ala Lys Lys Ala Arg AsnAsn 305 310 315 320 AAA TTA ACC CTT AGC GAG CTT GAG GGA GGC TCC TTC ACGATT ACA AAC 1008 Lys Leu Thr Leu Ser Glu Leu Glu Gly Gly Ser Phe Thr IleThr Asn 325 330 335 GGA GGG ACT TTC GGT TCA TTG ATG TCA ACT CCA ATT TTAAAC AGC CCG 1056 Gly Gly Thr Phe Gly Ser Leu Met Ser Thr Pro Ile Leu AsnSer Pro 340 345 350 CAA GTC GGT ATA CTG GGC ATG CAT AAG ATT CAG CTG CGCCCT GTA GCC 1104 Gln Val Gly Ile Leu Gly Met His Lys Ile Gln Leu Arg ProVal Ala 355 360 365 ATT GAT GAA GAG CGT TTC GAA AAC CGT CCG ATG ATG TATATC GCT TTA 1152 Ile Asp Glu Glu Arg Phe Glu Asn Arg Pro Met Met Tyr IleAla Leu 370 375 380 TCT TAT GAT CAC CGA ATT GTA GAC GGT AAA GAA GCG GTTGGT TTC CTC 1200 Ser Tyr Asp His Arg Ile Val Asp Gly Lys Glu Ala Val GlyPhe Leu 385 390 395 400 GTG ACA ATC AAA AAT TTA CTG GAA GAT CCT GAA CAGCTT TTA TTA GAA 1248 Val Thr Ile Lys Asn Leu Leu Glu Asp Pro Glu Gln LeuLeu Leu Glu 405 410 415 GGA TAA 1254 Gly 417 amino acids amino acidlinear protein 40 Met Ala Glu Ile Lys Val Pro Glu Leu Ala Glu Ser IleSer Glu Gly 1 5 10 15 Thr Ile Ala Gln Trp Leu Lys Gln Pro Gly Asp TyrVal Glu Gln Gly 20 25 30 Glu Tyr Leu Leu Glu Leu Glu Thr Asp Lys Val AsnVal Glu Leu Thr 35 40 45 Ala Glu Glu Ser Gly Val Leu Gln Glu Val Leu LysAsp Ser Gly Asp 50 55 60 Thr Val Gln Val Gly Glu Ile Ile Gly Thr Ile SerGlu Gly Ala Gly 65 70 75 80 Glu Ser Ser Ala Pro Ala Pro Thr Glu Lys ThrGlu Ser Lys Glu Ser 85 90 95 Val Lys Glu Glu Lys Gln Ala Glu Pro Ala AlaGln Glu Val Ser Glu 100 105 110 Glu Ala Gln Ser Glu Ala Lys Ser Arg ThrIle Ala Ser Pro Ser Ala 115 120 125 Arg Lys Leu Ala Arg Glu Lys Gly IleAsp Leu Ser Gln Val Pro Thr 130 135 140 Gly Asp Pro Leu Gly Arg Val ArgLys Gln Asp Val Glu Ala Tyr Glu 145 150 155 160 Lys Pro Ala Ser Lys ProAla Pro Gln Gln Lys Gln Gln Pro Gln Ala 165 170 175 Gln Lys Ala Gln GlnSer Phe Asp Lys Pro Val Glu Val Gln Lys Met 180 185 190 Ser Arg Arg ArgGln Thr Ile Ala Lys Arg Leu Val Glu Val Gln Gln 195 200 205 Thr Ser AlaMet Leu Thr Thr Phe Asn Glu Val Asp Met Thr Ala Val 210 215 220 Met AsnLeu Arg Lys Arg Arg Lys Asp Gln Phe Phe Glu Gln Asn Glu 225 230 235 240Val Lys Leu Gly Phe Met Ser Phe Phe Thr Lys Ala Val Val Ala Ala 245 250255 Leu Lys Lys Tyr Pro Leu Leu Asn Ala Glu Ile Gln Gly Asp Glu Leu 260265 270 Ile Val Lys Lys Phe Tyr Asp Ile Gly Ile Ala Val Ala Ala Val Glu275 280 285 Gly Leu Val Val Pro Val Val Arg Asp Ala Asp Arg Leu Thr PheAla 290 295 300 Gly Ile Glu Lys Glu Ile Gly Glu Leu Ala Lys Lys Ala ArgAsn Asn 305 310 315 320 Lys Leu Thr Leu Ser Glu Leu Glu Gly Gly Ser PheThr Ile Thr Asn 325 330 335 Gly Gly Thr Phe Gly Ser Leu Met Ser Thr ProIle Leu Asn Ser Pro 340 345 350 Gln Val Gly Ile Leu Gly Met His Lys IleGln Leu Arg Pro Val Ala 355 360 365 Ile Asp Glu Glu Arg Phe Glu Asn ArgPro Met Met Tyr Ile Ala Leu 370 375 380 Ser Tyr Asp His Arg Ile Val AspGly Lys Glu Ala Val Gly Phe Leu 385 390 395 400 Val Thr Ile Lys Asn LeuLeu Glu Asp Pro Glu Gln Leu Leu Leu Glu 405 410 415 Gly 1254 base pairsnucleic acid single linear other nucleic acid /desc = ”DNA (genomic)(p10b2)“ CDS 1..1251 41 ATG GCG GAA ATT AAG GTA CCT GAA TTA GCA GAA TCAATC TCA GAA GGA 48 Met Ala Glu Ile Lys Val Pro Glu Leu Ala Glu Ser IleSer Glu Gly 1 5 10 15 ACA ATA GCC CAA TGG TTA AAG CAG CCT GGT GAC TATGTA GAA CAG GGT 96 Thr Ile Ala Gln Trp Leu Lys Gln Pro Gly Asp Tyr ValGlu Gln Gly 20 25 30 GAA TAT CTG CTT GAA CTA GAA ACG GAT AAA GTG AAT GTTGAA TTG ACA 144 Glu Tyr Leu Leu Glu Leu Glu Thr Asp Lys Val Asn Val GluLeu Thr 35 40 45 GCA GAA GAA TCG GGT GTA CTT CAA GAG GTA TTG AAA GAT TCGGGT GAT 192 Ala Glu Glu Ser Gly Val Leu Gln Glu Val Leu Lys Asp Ser GlyAsp 50 55 60 ACC GTC CAG GTC GGA GAA ATT ATC GGT ACG ATT TCA GAA GGC GCGGGT 240 Thr Val Gln Val Gly Glu Ile Ile Gly Thr Ile Ser Glu Gly Ala Gly65 70 75 80 GAA AGT TCT GCC CCT GCT CCT ACA GAG AAA ACA GAA AGC AAG GAAAGC 288 Glu Ser Ser Ala Pro Ala Pro Thr Glu Lys Thr Glu Ser Lys Glu Ser85 90 95 GTA AAA GAA GAG AAA CAG GCT GAA CCA GCT GCA CAA GAG GTG AGC GAG336 Val Lys Glu Glu Lys Gln Ala Glu Pro Ala Ala Gln Glu Val Ser Glu 100105 110 GAA GCA CAA TCT GAA GCA AAA TCA AGA ACG ATC GCT TCT CCG TCG GCC384 Glu Ala Gln Ser Glu Ala Lys Ser Arg Thr Ile Ala Ser Pro Ser Ala 115120 125 CGT AAG CTT GCG AGA GAA AAA GGA ATT GAC CTG TCT CAA GTT CCA ACT432 Arg Lys Leu Ala Arg Glu Lys Gly Ile Asp Leu Ser Gln Val Pro Thr 130135 140 GGA GAT CCG CTT GGA AGA GTG CGC AAG CAG GAT GTC GAA GCG TAC GAA480 Gly Asp Pro Leu Gly Arg Val Arg Lys Gln Asp Val Glu Ala Tyr Glu 145150 155 160 AAA CCG GCA TCA AAA CCG GCT CCT CAG CAA AAG CAG CAG CCT CAGGCT 528 Lys Pro Ala Ser Lys Pro Ala Pro Gln Gln Lys Gln Gln Pro Gln Ala165 170 175 CAA AAA GCA CAG CAA AGC TTT GAC AAA CCT GTT GAA GTG CAA AAAATG 576 Gln Lys Ala Gln Gln Ser Phe Asp Lys Pro Val Glu Val Gln Lys Met180 185 190 TCA CGC CGC AGA CAA ACG ATT GCA AAA CGC CTT GTA GAG GTA CAGCAA 624 Ser Arg Arg Arg Gln Thr Ile Ala Lys Arg Leu Val Glu Val Gln Gln195 200 205 ACA TCA GCG ATG CTG ACT ACA TTT AAT GAA GTG GAC ATG ACG GCTGTC 672 Thr Ser Ala Met Leu Thr Thr Phe Asn Glu Val Asp Met Thr Ala Val210 215 220 ATG AAT CTC AGA AAA CGC CGC AAA GAT CAA TTT TTT GAG CAA AATGAA 720 Met Asn Leu Arg Lys Arg Arg Lys Asp Gln Phe Phe Glu Gln Asn Glu225 230 235 240 GTG AAG CTC GGC TTT ATG TCT TTC TTC ACG AAA GCG GTC GTGGCT GCA 768 Val Lys Leu Gly Phe Met Ser Phe Phe Thr Lys Ala Val Val AlaAla 245 250 255 TTG AAA AAA TAT CCG CTG TTG AAT GCA GAA ATT CAA GGC GATGAG TTG 816 Leu Lys Lys Tyr Pro Leu Leu Asn Ala Glu Ile Gln Gly Asp GluLeu 260 265 270 ATC GTT AAA AAA TTC TAC GAT ATC GGA ATC GCT GTT GCT GCTGTA GAA 864 Ile Val Lys Lys Phe Tyr Asp Ile Gly Ile Ala Val Ala Ala ValGlu 275 280 285 GGT CTT GTC GTT CCG GTT GTA CGG GAT GCG GAT CGC CTG ACATTT GCA 912 Gly Leu Val Val Pro Val Val Arg Asp Ala Asp Arg Leu Thr PheAla 290 295 300 GGA ATC GAA AAA GAG ATC GGC GAG CTT GCG AAA AAA GCA AGAAAC AAT 960 Gly Ile Glu Lys Glu Ile Gly Glu Leu Ala Lys Lys Ala Arg AsnAsn 305 310 315 320 AAA TTA ACC CTT AGC GAG CTT GAG GGA GGC TCC TTC ACGATT ACA AAC 1008 Lys Leu Thr Leu Ser Glu Leu Glu Gly Gly Ser Phe Thr IleThr Asn 325 330 335 GGA GGG ACT TTC GGT TCA TTG ATG TCA ACT CCA ATT TTAAAC AGC CCG 1056 Gly Gly Thr Phe Gly Ser Leu Met Ser Thr Pro Ile Leu AsnSer Pro 340 345 350 CAA GTC GGT ATA CTG GGC ATG CAT AAG ATT CAG CTG CGCCCT GTA GCC 1104 Gln Val Gly Ile Leu Gly Met His Lys Ile Gln Leu Arg ProVal Ala 355 360 365 ATT GAT GAA GAG CGT TTC GAA AAC CGT CCG ATG ATG TATATC GCT TTA 1152 Ile Asp Glu Glu Arg Phe Glu Asn Arg Pro Met Met Tyr IleAla Leu 370 375 380 TCT TAT GAT CAC CGA ATT GTA GAC GGT AAA GAA GCG GTTGGT TTC CTC 1200 Ser Tyr Asp His Arg Ile Val Asp Gly Lys Glu Ala Val GlyPhe Leu 385 390 395 400 GTG ACA ATC AAA AAT TTA CTG GAA GAT CCT GAA CAGCTT TTA TTA GAA 1248 Val Thr Ile Lys Asn Leu Leu Glu Asp Pro Glu Gln LeuLeu Leu Glu 405 410 415 GGA TAA 1254 Gly 417 amino acids amino acidlinear protein 42 Met Ala Glu Ile Lys Val Pro Glu Leu Ala Glu Ser IleSer Glu Gly 1 5 10 15 Thr Ile Ala Gln Trp Leu Lys Gln Pro Gly Asp TyrVal Glu Gln Gly 20 25 30 Glu Tyr Leu Leu Glu Leu Glu Thr Asp Lys Val AsnVal Glu Leu Thr 35 40 45 Ala Glu Glu Ser Gly Val Leu Gln Glu Val Leu LysAsp Ser Gly Asp 50 55 60 Thr Val Gln Val Gly Glu Ile Ile Gly Thr Ile SerGlu Gly Ala Gly 65 70 75 80 Glu Ser Ser Ala Pro Ala Pro Thr Glu Lys ThrGlu Ser Lys Glu Ser 85 90 95 Val Lys Glu Glu Lys Gln Ala Glu Pro Ala AlaGln Glu Val Ser Glu 100 105 110 Glu Ala Gln Ser Glu Ala Lys Ser Arg ThrIle Ala Ser Pro Ser Ala 115 120 125 Arg Lys Leu Ala Arg Glu Lys Gly IleAsp Leu Ser Gln Val Pro Thr 130 135 140 Gly Asp Pro Leu Gly Arg Val ArgLys Gln Asp Val Glu Ala Tyr Glu 145 150 155 160 Lys Pro Ala Ser Lys ProAla Pro Gln Gln Lys Gln Gln Pro Gln Ala 165 170 175 Gln Lys Ala Gln GlnSer Phe Asp Lys Pro Val Glu Val Gln Lys Met 180 185 190 Ser Arg Arg ArgGln Thr Ile Ala Lys Arg Leu Val Glu Val Gln Gln 195 200 205 Thr Ser AlaMet Leu Thr Thr Phe Asn Glu Val Asp Met Thr Ala Val 210 215 220 Met AsnLeu Arg Lys Arg Arg Lys Asp Gln Phe Phe Glu Gln Asn Glu 225 230 235 240Val Lys Leu Gly Phe Met Ser Phe Phe Thr Lys Ala Val Val Ala Ala 245 250255 Leu Lys Lys Tyr Pro Leu Leu Asn Ala Glu Ile Gln Gly Asp Glu Leu 260265 270 Ile Val Lys Lys Phe Tyr Asp Ile Gly Ile Ala Val Ala Ala Val Glu275 280 285 Gly Leu Val Val Pro Val Val Arg Asp Ala Asp Arg Leu Thr PheAla 290 295 300 Gly Ile Glu Lys Glu Ile Gly Glu Leu Ala Lys Lys Ala ArgAsn Asn 305 310 315 320 Lys Leu Thr Leu Ser Glu Leu Glu Gly Gly Ser PheThr Ile Thr Asn 325 330 335 Gly Gly Thr Phe Gly Ser Leu Met Ser Thr ProIle Leu Asn Ser Pro 340 345 350 Gln Val Gly Ile Leu Gly Met His Lys IleGln Leu Arg Pro Val Ala 355 360 365 Ile Asp Glu Glu Arg Phe Glu Asn ArgPro Met Met Tyr Ile Ala Leu 370 375 380 Ser Tyr Asp His Arg Ile Val AspGly Lys Glu Ala Val Gly Phe Leu 385 390 395 400 Val Thr Ile Lys Asn LeuLeu Glu Asp Pro Glu Gln Leu Leu Leu Glu 405 410 415 Gly 524 base pairsnucleic acid single linear other nucleic acid /desc = ”DNA (genomic)(p10c20)“ 43 TTTCGAAGAT CGGCTCTTAC AATATTGACG GTAAAGAACA GTTGGATTCTTAAAAACAAT 60 TAAAGAATTA ATTGAAAACC CAAAGACTTA TTATAGAATC TTAATCCACAACACAAAATA 120 GTTTACTATT CCTAAAAGCG GGATTAAATC AATAACAAAC AGCATAAGATTATTTCCTAG 180 TCGAAATATC TTACTGCTGT ACTTTATTTT TATAATGATC TTTAATGTGGGTTTAATTTT 240 GACTACTTAA AAATATATCA TTTCTATTGA AATAGACTCA CAATACAAATATAGTAGAAT 300 GCGTGTTTCA ATATGCTAAT GAATGCATTT TAGATATAAC AAACGAGAAATATATGAATT 360 CTATAAGCGC CTTTAGTTAA TTTAAATCTC TGAACATGAT GTAATTCGCTTTATGGACAC 420 CACATTATAT GTTTTCATGT CCTACAACAC AACACATATT CAATTGTATATACAGATATT 480 CTTAATGACA CACTCATAGC CCCATAACAA TATATGTTAC CTAT 524 2799base pairs nucleic acid single linear other nucleic acid /desc = ”DNA(genomic) (p12c32)“ CDS 1..2796 44 ATG ACT AAC GAA AGA AAA GAA GTT TCAGAG GCT CCT GTA AAC TTC GGT 48 Met Thr Asn Glu Arg Lys Glu Val Ser GluAla Pro Val Asn Phe Gly 1 5 10 15 GCG AAT TTA GGT CTA ATG TTA GAT CTATAT GAT GAC TTT TTA CAA GAT 96 Ala Asn Leu Gly Leu Met Leu Asp Leu TyrAsp Asp Phe Leu Gln Asp 20 25 30 CCA TCA TCT GTA CCA GAA GAT TTA CAA GTCTTA TTC AGC ACA ATT AAG 144 Pro Ser Ser Val Pro Glu Asp Leu Gln Val LeuPhe Ser Thr Ile Lys 35 40 45 AAT GAT GAC TCA ATT GTA CCA GCT TTA AAA AGTACA AGT AGT CAA AAT 192 Asn Asp Asp Ser Ile Val Pro Ala Leu Lys Ser ThrSer Ser Gln Asn 50 55 60 AGC GAC GGC ACA ATT AAG CGT GTC ATG CGT TTA ATTGAT AAT ATT CGC 240 Ser Asp Gly Thr Ile Lys Arg Val Met Arg Leu Ile AspAsn Ile Arg 65 70 75 80 CAA TAC GGG CAT CTT AAA GCC GAT ATT TAT CCT GTAAAT CCT CCA AAA 288 Gln Tyr Gly His Leu Lys Ala Asp Ile Tyr Pro Val AsnPro Pro Lys 85 90 95 AGG AAA CAT GTA CCT AAA TTA GAG ATT GAA GAC TTT GATTTA GAT CAA 336 Arg Lys His Val Pro Lys Leu Glu Ile Glu Asp Phe Asp LeuAsp Gln 100 105 110 CAG ACT TTG GAA GGT ATA TCA GCA GGA ATT GTT TCA GATCAC TTT GCC 384 Gln Thr Leu Glu Gly Ile Ser Ala Gly Ile Val Ser Asp HisPhe Ala 115 120 125 GAC ATT TAT GAT AAT GCT TAT GAA GCA ATT TTA AGA ATGGAA AAA CGT 432 Asp Ile Tyr Asp Asn Ala Tyr Glu Ala Ile Leu Arg Met GluLys Arg 130 135 140 TAC AAA GGA CCA ATT GCA TTT GAG TAT ACA CAT ATT AATAAC AAT ACC 480 Tyr Lys Gly Pro Ile Ala Phe Glu Tyr Thr His Ile Asn AsnAsn Thr 145 150 155 160 GAA CGT GGT TGG TTA AAA AGA AGA ATT GAA ACG CCATAT AAA GTA ACG 528 Glu Arg Gly Trp Leu Lys Arg Arg Ile Glu Thr Pro TyrLys Val Thr 165 170 175 TTA AAT AAT AAC GAA AAA AGG GCA CTA TTC AAA CAATTA GCG TAT GTT 576 Leu Asn Asn Asn Glu Lys Arg Ala Leu Phe Lys Gln LeuAla Tyr Val 180 185 190 GAA GGG TTT GAA AAA TAT CTT CAT AAA AAC TTC GTTGGT GCA AAG CGT 624 Glu Gly Phe Glu Lys Tyr Leu His Lys Asn Phe Val GlyAla Lys Arg 195 200 205 TTT TCA ATT GAA GGG GTA GAC GCA CTT GTA CCG ATGTTA CAA CGT ACT 672 Phe Ser Ile Glu Gly Val Asp Ala Leu Val Pro Met LeuGln Arg Thr 210 215 220 ATT ACG ATT GCT GCG AAA GAA GGT ATT AAA AAT ATACAA ATA GGC ATG 720 Ile Thr Ile Ala Ala Lys Glu Gly Ile Lys Asn Ile GlnIle Gly Met 225 230 235 240 GCT CAC CGT GGA CGT TTA AAC GTT TTA ACG CATGTC TTA GAA AAA CCG 768 Ala His Arg Gly Arg Leu Asn Val Leu Thr His ValLeu Glu Lys Pro 245 250 255 TAC GAA ATG ATG ATT TCA GAA TTT ATG CAT ACAGAT CCA ATG AAA TTC 816 Tyr Glu Met Met Ile Ser Glu Phe Met His Thr AspPro Met Lys Phe 260 265 270 TTA CCT GAA GAT GGT AGC TTG CAG TTA ACT GCTGGA TGG ACT GGT GAT 864 Leu Pro Glu Asp Gly Ser Leu Gln Leu Thr Ala GlyTrp Thr Gly Asp 275 280 285 GTG AAA TAT CAC CTT GGT GGC ATT AAA ACT ACTGAT TCA TAC GGT ACA 912 Val Lys Tyr His Leu Gly Gly Ile Lys Thr Thr AspSer Tyr Gly Thr 290 295 300 ATG CAG CGT ATT GCA CTG GCT AAC AAT CCA AGTCAC TTG GAA ATT GTT 960 Met Gln Arg Ile Ala Leu Ala Asn Asn Pro Ser HisLeu Glu Ile Val 305 310 315 320 GCA CCT GTT GTT GAG GGG CGT ACG AGA GCAGCA CAA GAT GAT ACA CAA 1008 Ala Pro Val Val Glu Gly Arg Thr Arg Ala AlaGln Asp Asp Thr Gln 325 330 335 CGA GCT GGG GCT CCG ACG ACT GAT CAT CATAAA GCA ATG CCA ATT ATT 1056 Arg Ala Gly Ala Pro Thr Thr Asp His His LysAla Met Pro Ile Ile 340 345 350 ATA CAT GGC GAT GCT GCT TAT CCT GGT CAAGGA ATT AAC TTC GAA ACA 1104 Ile His Gly Asp Ala Ala Tyr Pro Gly Gln GlyIle Asn Phe Glu Thr 355 360 365 ATG AAC TTA GGA AAC TTG AAA GGC TAT TCTACG GGT GGT TCA TTG CAT 1152 Met Asn Leu Gly Asn Leu Lys Gly Tyr Ser ThrGly Gly Ser Leu His 370 375 380 ATT ATT ACT AAC AAT AGA ATT GGA TTT ACTACA GAA CCA ATT GAT GCA 1200 Ile Ile Thr Asn Asn Arg Ile Gly Phe Thr ThrGlu Pro Ile Asp Ala 385 390 395 400 CGT TCA ACA ACT TAT TCT ACA GAT GTGGCC AAA GGT TAT GAT GTG CCA 1248 Arg Ser Thr Thr Tyr Ser Thr Asp Val AlaLys Gly Tyr Asp Val Pro 405 410 415 ATA TTC CAT GTC AAT GCA GAT GAC GTTGAA GCT ACT ATT GAA GCA ATT 1296 Ile Phe His Val Asn Ala Asp Asp Val GluAla Thr Ile Glu Ala Ile 420 425 430 GAT ATT GCA ATG GAA TTT AGA AAA GAGTTT CAT AAA GAC GTC GTT ATT 1344 Asp Ile Ala Met Glu Phe Arg Lys Glu PheHis Lys Asp Val Val Ile 435 440 445 GAT TTA GTA GGT TAT CGT CGT TTC GGACAT AAC GAA ATG GAT GAA CCA 1392 Asp Leu Val Gly Tyr Arg Arg Phe Gly HisAsn Glu Met Asp Glu Pro 450 455 460 TCA ATT ACT AAT CCA GTT CCT TAT CAGAAT ATT CGC AAA CAT GAC TCT 1440 Ser Ile Thr Asn Pro Val Pro Tyr Gln AsnIle Arg Lys His Asp Ser 465 470 475 480 GTT GAA TAT GTG TTT GGT AAA AAGCTT GTT AAT GAA GGT GTC ATT TCA 1488 Val Glu Tyr Val Phe Gly Lys Lys LeuVal Asn Glu Gly Val Ile Ser 485 490 495 GAA GAT GAA ATG CAT TCA TTT ATAGAA CAA GTC CAA AAG GAA CTA AGA 1536 Glu Asp Glu Met His Ser Phe Ile GluGln Val Gln Lys Glu Leu Arg 500 505 510 CAA GCT CAT GAT AAA ATT AAT AAAGCT GAT AAA ATG GAT AAT CCA GAT 1584 Gln Ala His Asp Lys Ile Asn Lys AlaAsp Lys Met Asp Asn Pro Asp 515 520 525 ATG GAA AAG CCT GCA GAT CTT GCATTA CCG TTA CAA GCA GAC GAA CAA 1632 Met Glu Lys Pro Ala Asp Leu Ala LeuPro Leu Gln Ala Asp Glu Gln 530 535 540 TCA TTT ACT TTT GAT CAC TTG AAAGAA ATA AAT GAT GCA TTG TTA ACA 1680 Ser Phe Thr Phe Asp His Leu Lys GluIle Asn Asp Ala Leu Leu Thr 545 550 555 560 TAT CCG GAT GGC TTT AAC ATTTTG AAA AAG TTA AAC AAA GTT CTT GAG 1728 Tyr Pro Asp Gly Phe Asn Ile LeuLys Lys Leu Asn Lys Val Leu Glu 565 570 575 AAG CGT CAT GAG CCG TTT AATAAA GAA GAT GGT TTA GTT GAT TGG GCA 1776 Lys Arg His Glu Pro Phe Asn LysGlu Asp Gly Leu Val Asp Trp Ala 580 585 590 CAA GCA GAA CAA CTT GCA TTTGCG ACA ATT TTA CAA GAT GGT ACA CCG 1824 Gln Ala Glu Gln Leu Ala Phe AlaThr Ile Leu Gln Asp Gly Thr Pro 595 600 605 ATT CGC TTA ACT GGT CAA GATAGT GAA CGT GGT ACA TTC AGT CAT AGG 1872 Ile Arg Leu Thr Gly Gln Asp SerGlu Arg Gly Thr Phe Ser His Arg 610 615 620 CAT GCC GTG TTA CAT GAT GAGCAA ACA GGT GAA ACA TAT ACA CCT TTA 1920 His Ala Val Leu His Asp Glu GlnThr Gly Glu Thr Tyr Thr Pro Leu 625 630 635 640 CAT CAT GTT CCT GAT CAAAAA GCG ACA TTT GAT ATA CAC AAT TCT CCG 1968 His His Val Pro Asp Gln LysAla Thr Phe Asp Ile His Asn Ser Pro 645 650 655 CTT TCA GAA GCA GCA GTAGTT GGT TTT GAA TAC GGC TAT AAT GTG GAA 2016 Leu Ser Glu Ala Ala Val ValGly Phe Glu Tyr Gly Tyr Asn Val Glu 660 665 670 AAC AAA AAA AGC TTC AATATT TGG GAA GCA CAA TAT GGT GAT TTT GCA 2064 Asn Lys Lys Ser Phe Asn IleTrp Glu Ala Gln Tyr Gly Asp Phe Ala 675 680 685 AAT ATG TCA CAA ATG ATTTTT GAC AAC TTC TTA TTC AGT TCT CGC TCA 2112 Asn Met Ser Gln Met Ile PheAsp Asn Phe Leu Phe Ser Ser Arg Ser 690 695 700 AAA TGG GGA GAA CGT TCAGGA TTA ACA TTA TTC TTA CCT CAT GCA TAT 2160 Lys Trp Gly Glu Arg Ser GlyLeu Thr Leu Phe Leu Pro His Ala Tyr 705 710 715 720 GAG GGT CAA GGG CCTGAA CAT TCA TCA GCA AGA TTA GAG CGA TTT TTA 2208 Glu Gly Gln Gly Pro GluHis Ser Ser Ala Arg Leu Glu Arg Phe Leu 725 730 735 CAA TTA GCT GCT GAAAAT AAT TGC ACA GTT GTC AAC TTA TCT AGT TCA 2256 Gln Leu Ala Ala Glu AsnAsn Cys Thr Val Val Asn Leu Ser Ser Ser 740 745 750 AGT AAT TAT TTC CACTTA TTG CGT GCA CAA GCG GCT AGT TTA GAT TCT 2304 Ser Asn Tyr Phe His LeuLeu Arg Ala Gln Ala Ala Ser Leu Asp Ser 755 760 765 GAA CAA ATG CGA CCATTG GTT GTT ATG TCA CCA AAA AGC TTA CTG AGA 2352 Glu Gln Met Arg Pro LeuVal Val Met Ser Pro Lys Ser Leu Leu Arg 770 775 780 AAT AAA ACA GTT GCAAAA CCA ATT GAT GAA TTT ACT TCT GGT GGA TTT 2400 Asn Lys Thr Val Ala LysPro Ile Asp Glu Phe Thr Ser Gly Gly Phe 785 790 795 800 GAG CCA ATT TTGACA GAA TCA TAT CAA GCG GAT AAG GTT ACA AAA GTT 2448 Glu Pro Ile Leu ThrGlu Ser Tyr Gln Ala Asp Lys Val Thr Lys Val 805 810 815 ATT TTG GCA ACTGGT AAA ATG TTC ATT GAT TTA AAA GAA GCA TTA GCT 2496 Ile Leu Ala Thr GlyLys Met Phe Ile Asp Leu Lys Glu Ala Leu Ala 820 825 830 AAA AAT CCA GACGAA TCA GTA TTA CTC GTT GCG ATT GAA AGA TTG TAT 2544 Lys Asn Pro Asp GluSer Val Leu Leu Val Ala Ile Glu Arg Leu Tyr 835 840 845 CCA TTC CCA GAGGAA GAG ATT GAA GCA TTA CTA GCA CAA TTG CCA AAC 2592 Pro Phe Pro Glu GluGlu Ile Glu Ala Leu Leu Ala Gln Leu Pro Asn 850 855 860 CTT GAA GAA GTGTCA TGG GTA CAA GAA GAA CCT AAA AAT CAA GGT GCA 2640 Leu Glu Glu Val SerTrp Val Gln Glu Glu Pro Lys Asn Gln Gly Ala 865 870 875 880 TGG TTA TATGTC TAT CCA TAT GTT AAA GTG CTA GTT GCA GAT AAA TAT 2688 Trp Leu Tyr ValTyr Pro Tyr Val Lys Val Leu Val Ala Asp Lys Tyr 885 890 895 GAT TTA AGTTAT CAT GGC AGA ATT CAA AGG GCT GCT CCA GCT GAA GGC 2736 Asp Leu Ser TyrHis Gly Arg Ile Gln Arg Ala Ala Pro Ala Glu Gly 900 905 910 GAT GGA GAAATT CAT AAA CTT GTT CAA AAT AAA ATT ATA GAA AAT GCA 2784 Asp Gly Glu IleHis Lys Leu Val Gln Asn Lys Ile Ile Glu Asn Ala 915 920 925 TTA AAA AATAAC TAG 2799 Leu Lys Asn Asn 930 932 amino acids amino acid linearprotein 45 Met Thr Asn Glu Arg Lys Glu Val Ser Glu Ala Pro Val Asn PheGly 1 5 10 15 Ala Asn Leu Gly Leu Met Leu Asp Leu Tyr Asp Asp Phe LeuGln Asp 20 25 30 Pro Ser Ser Val Pro Glu Asp Leu Gln Val Leu Phe Ser ThrIle Lys 35 40 45 Asn Asp Asp Ser Ile Val Pro Ala Leu Lys Ser Thr Ser SerGln Asn 50 55 60 Ser Asp Gly Thr Ile Lys Arg Val Met Arg Leu Ile Asp AsnIle Arg 65 70 75 80 Gln Tyr Gly His Leu Lys Ala Asp Ile Tyr Pro Val AsnPro Pro Lys 85 90 95 Arg Lys His Val Pro Lys Leu Glu Ile Glu Asp Phe AspLeu Asp Gln 100 105 110 Gln Thr Leu Glu Gly Ile Ser Ala Gly Ile Val SerAsp His Phe Ala 115 120 125 Asp Ile Tyr Asp Asn Ala Tyr Glu Ala Ile LeuArg Met Glu Lys Arg 130 135 140 Tyr Lys Gly Pro Ile Ala Phe Glu Tyr ThrHis Ile Asn Asn Asn Thr 145 150 155 160 Glu Arg Gly Trp Leu Lys Arg ArgIle Glu Thr Pro Tyr Lys Val Thr 165 170 175 Leu Asn Asn Asn Glu Lys ArgAla Leu Phe Lys Gln Leu Ala Tyr Val 180 185 190 Glu Gly Phe Glu Lys TyrLeu His Lys Asn Phe Val Gly Ala Lys Arg 195 200 205 Phe Ser Ile Glu GlyVal Asp Ala Leu Val Pro Met Leu Gln Arg Thr 210 215 220 Ile Thr Ile AlaAla Lys Glu Gly Ile Lys Asn Ile Gln Ile Gly Met 225 230 235 240 Ala HisArg Gly Arg Leu Asn Val Leu Thr His Val Leu Glu Lys Pro 245 250 255 TyrGlu Met Met Ile Ser Glu Phe Met His Thr Asp Pro Met Lys Phe 260 265 270Leu Pro Glu Asp Gly Ser Leu Gln Leu Thr Ala Gly Trp Thr Gly Asp 275 280285 Val Lys Tyr His Leu Gly Gly Ile Lys Thr Thr Asp Ser Tyr Gly Thr 290295 300 Met Gln Arg Ile Ala Leu Ala Asn Asn Pro Ser His Leu Glu Ile Val305 310 315 320 Ala Pro Val Val Glu Gly Arg Thr Arg Ala Ala Gln Asp AspThr Gln 325 330 335 Arg Ala Gly Ala Pro Thr Thr Asp His His Lys Ala MetPro Ile Ile 340 345 350 Ile His Gly Asp Ala Ala Tyr Pro Gly Gln Gly IleAsn Phe Glu Thr 355 360 365 Met Asn Leu Gly Asn Leu Lys Gly Tyr Ser ThrGly Gly Ser Leu His 370 375 380 Ile Ile Thr Asn Asn Arg Ile Gly Phe ThrThr Glu Pro Ile Asp Ala 385 390 395 400 Arg Ser Thr Thr Tyr Ser Thr AspVal Ala Lys Gly Tyr Asp Val Pro 405 410 415 Ile Phe His Val Asn Ala AspAsp Val Glu Ala Thr Ile Glu Ala Ile 420 425 430 Asp Ile Ala Met Glu PheArg Lys Glu Phe His Lys Asp Val Val Ile 435 440 445 Asp Leu Val Gly TyrArg Arg Phe Gly His Asn Glu Met Asp Glu Pro 450 455 460 Ser Ile Thr AsnPro Val Pro Tyr Gln Asn Ile Arg Lys His Asp Ser 465 470 475 480 Val GluTyr Val Phe Gly Lys Lys Leu Val Asn Glu Gly Val Ile Ser 485 490 495 GluAsp Glu Met His Ser Phe Ile Glu Gln Val Gln Lys Glu Leu Arg 500 505 510Gln Ala His Asp Lys Ile Asn Lys Ala Asp Lys Met Asp Asn Pro Asp 515 520525 Met Glu Lys Pro Ala Asp Leu Ala Leu Pro Leu Gln Ala Asp Glu Gln 530535 540 Ser Phe Thr Phe Asp His Leu Lys Glu Ile Asn Asp Ala Leu Leu Thr545 550 555 560 Tyr Pro Asp Gly Phe Asn Ile Leu Lys Lys Leu Asn Lys ValLeu Glu 565 570 575 Lys Arg His Glu Pro Phe Asn Lys Glu Asp Gly Leu ValAsp Trp Ala 580 585 590 Gln Ala Glu Gln Leu Ala Phe Ala Thr Ile Leu GlnAsp Gly Thr Pro 595 600 605 Ile Arg Leu Thr Gly Gln Asp Ser Glu Arg GlyThr Phe Ser His Arg 610 615 620 His Ala Val Leu His Asp Glu Gln Thr GlyGlu Thr Tyr Thr Pro Leu 625 630 635 640 His His Val Pro Asp Gln Lys AlaThr Phe Asp Ile His Asn Ser Pro 645 650 655 Leu Ser Glu Ala Ala Val ValGly Phe Glu Tyr Gly Tyr Asn Val Glu 660 665 670 Asn Lys Lys Ser Phe AsnIle Trp Glu Ala Gln Tyr Gly Asp Phe Ala 675 680 685 Asn Met Ser Gln MetIle Phe Asp Asn Phe Leu Phe Ser Ser Arg Ser 690 695 700 Lys Trp Gly GluArg Ser Gly Leu Thr Leu Phe Leu Pro His Ala Tyr 705 710 715 720 Glu GlyGln Gly Pro Glu His Ser Ser Ala Arg Leu Glu Arg Phe Leu 725 730 735 GlnLeu Ala Ala Glu Asn Asn Cys Thr Val Val Asn Leu Ser Ser Ser 740 745 750Ser Asn Tyr Phe His Leu Leu Arg Ala Gln Ala Ala Ser Leu Asp Ser 755 760765 Glu Gln Met Arg Pro Leu Val Val Met Ser Pro Lys Ser Leu Leu Arg 770775 780 Asn Lys Thr Val Ala Lys Pro Ile Asp Glu Phe Thr Ser Gly Gly Phe785 790 795 800 Glu Pro Ile Leu Thr Glu Ser Tyr Gln Ala Asp Lys Val ThrLys Val 805 810 815 Ile Leu Ala Thr Gly Lys Met Phe Ile Asp Leu Lys GluAla Leu Ala 820 825 830 Lys Asn Pro Asp Glu Ser Val Leu Leu Val Ala IleGlu Arg Leu Tyr 835 840 845 Pro Phe Pro Glu Glu Glu Ile Glu Ala Leu LeuAla Gln Leu Pro Asn 850 855 860 Leu Glu Glu Val Ser Trp Val Gln Glu GluPro Lys Asn Gln Gly Ala 865 870 875 880 Trp Leu Tyr Val Tyr Pro Tyr ValLys Val Leu Val Ala Asp Lys Tyr 885 890 895 Asp Leu Ser Tyr His Gly ArgIle Gln Arg Ala Ala Pro Ala Glu Gly 900 905 910 Asp Gly Glu Ile His LysLeu Val Gln Asn Lys Ile Ile Glu Asn Ala 915 920 925 Leu Lys Asn Asn 9302574 base pairs nucleic acid single linear other nucleic acid /desc =”DNA (genomic) (p10b30)“ CDS 1..2571 46 ATG CGT CTG GAT CGT CTT ACT AATAAA TTC CAG CTT GCT CTT GCC GAT 48 Met Arg Leu Asp Arg Leu Thr Asn LysPhe Gln Leu Ala Leu Ala Asp 1 5 10 15 GCC CAA TCA CTT GCA CTC GGG CACGAC AAC CAA TTT ATC GAA CCA CTT 96 Ala Gln Ser Leu Ala Leu Gly His AspAsn Gln Phe Ile Glu Pro Leu 20 25 30 CAT TTA ATG AGC GCC CTG CTG AAT CAGGAA GGG GGT TCG GTT AGT CCT 144 His Leu Met Ser Ala Leu Leu Asn Gln GluGly Gly Ser Val Ser Pro 35 40 45 TTA TTA ACA TCC GCT GGC ATA AAT GCT GGCCAG TTG CGC ACA GAT ATC 192 Leu Leu Thr Ser Ala Gly Ile Asn Ala Gly GlnLeu Arg Thr Asp Ile 50 55 60 AAT CAG GCA TTA AAT CGT TTA CCG CAG GTT GAAGGT ACT GGT GGT GAT 240 Asn Gln Ala Leu Asn Arg Leu Pro Gln Val Glu GlyThr Gly Gly Asp 65 70 75 80 GTC CAG CCA TCA CAG GAT CTG GTG CGC GTT CTTAAT CTT TGC GAC AAC 288 Val Gln Pro Ser Gln Asp Leu Val Arg Val Leu AsnLeu Cys Asp Asn 85 90 95 GTG GCG CAA AAA CGT GGT GAT AAC TTT ATC TCG TCAGAA CTG TTC GTT 336 Val Ala Gln Lys Arg Gly Asp Asn Phe Ile Ser Ser GluLeu Phe Val 100 105 110 CTG GCG GCA CTT GAG TCT CGC GGC ACC GTG GCC GACATC CTG AAA GCA 384 Leu Ala Ala Leu Glu Ser Arg Gly Thr Val Ala Asp IleLeu Lys Ala 115 120 125 GCA GGG GCG ACC ACC GCC AAC ATT ACT CAA GCG ATTGAA CAA ATG CGT 432 Ala Gly Ala Thr Thr Ala Asn Ile Thr Gln Ala Ile GluGln Met Arg 130 135 140 GGA GGT GAA AGC GTG AAC GAT CAA GGT GCT GAA GACCAA CGT CAG GCT 480 Gly Gly Glu Ser Val Asn Asp Gln Gly Ala Glu Asp GlnArg Gln Ala 145 150 155 160 TTG AAA AAA TAT ACC ATC GAC CTT ACC GAA CGAGCC GAA CAG GGC AAA 528 Leu Lys Lys Tyr Thr Ile Asp Leu Thr Glu Arg AlaGlu Gln Gly Lys 165 170 175 CTC GAT CCG GTG ATT GGT CGT GAT GAA GAA ATTCGC CGT ACC ATT CAG 576 Leu Asp Pro Val Ile Gly Arg Asp Glu Glu Ile ArgArg Thr Ile Gln 180 185 190 GTG CTG CAA CGT CGT ACT AAA AAT AAC CCG GTACTG ATT GGT GAA CCC 624 Val Leu Gln Arg Arg Thr Lys Asn Asn Pro Val LeuIle Gly Glu Pro 195 200 205 GGC GTC GGT AAA ACT GCC ATC GTT GAA GGT CTGGCG CAG CGT ATT ATC 672 Gly Val Gly Lys Thr Ala Ile Val Glu Gly Leu AlaGln Arg Ile Ile 210 215 220 AAC GGC GAA GTG CCG GAA GGG TTG AAA GGC CGCCGG GTA CTG GCG CTG 720 Asn Gly Glu Val Pro Glu Gly Leu Lys Gly Arg ArgVal Leu Ala Leu 225 230 235 240 GAT ATG GGC GCG CTG GTG GCT GGG GCG AAATAT CGC GGT GAG TTT GAA 768 Asp Met Gly Ala Leu Val Ala Gly Ala Lys TyrArg Gly Glu Phe Glu 245 250 255 GAA CGT TTA AAA GGC GTG CTT AAC GAT CTTGCC AAA CAG GAA GGC AAC 816 Glu Arg Leu Lys Gly Val Leu Asn Asp Leu AlaLys Gln Glu Gly Asn 260 265 270 GTC ATC CTA TTT ATC GAC GAA TTA CAT ACCATG GTC GGC GCG GGT AAA 864 Val Ile Leu Phe Ile Asp Glu Leu His Thr MetVal Gly Ala Gly Lys 275 280 285 GCC GAT GGC GCA ATG GAC GCC GGA AAC ATGCTG AAA CCG GCG CTG GCG 912 Ala Asp Gly Ala Met Asp Ala Gly Asn Met LeuLys Pro Ala Leu Ala 290 295 300 CGT GGT GAA TTG CAC TGC GTA GGT GCC ACGACG CTT GAC GAA TAT CGC 960 Arg Gly Glu Leu His Cys Val Gly Ala Thr ThrLeu Asp Glu Tyr Arg 305 310 315 320 CAG TAC ATT GAA AAA GAT GCT GCG CTGGAA CGT CGT TTC CAG AAA GTG 1008 Gln Tyr Ile Glu Lys Asp Ala Ala Leu GluArg Arg Phe Gln Lys Val 325 330 335 TTT GTT GCC GAG CCT TCT GTT GAA GATACC ATT GCG ATT CTG CGT GGC 1056 Phe Val Ala Glu Pro Ser Val Glu Asp ThrIle Ala Ile Leu Arg Gly 340 345 350 CTG AAA GAA CGT TAC GAA TTG CAC CACCAT GTG CAA ATT ACT GAC CCG 1104 Leu Lys Glu Arg Tyr Glu Leu His His HisVal Gln Ile Thr Asp Pro 355 360 365 GCA ATT GTT GCA GCG GCG ACG TTG TCTCAT CGC TAC ATT GCT GAC CGT 1152 Ala Ile Val Ala Ala Ala Thr Leu Ser HisArg Tyr Ile Ala Asp Arg 370 375 380 CAG CTG CCG GAT AAA GCC ATC GAC CTGATC GAT GAA GCA GCA TCC AGC 1200 Gln Leu Pro Asp Lys Ala Ile Asp Leu IleAsp Glu Ala Ala Ser Ser 385 390 395 400 ATT CGT ATG CAG ATT GAC TCA AAACCA GAA GAA CTC GAC CGA CTC GAT 1248 Ile Arg Met Gln Ile Asp Ser Lys ProGlu Glu Leu Asp Arg Leu Asp 405 410 415 CGT CGT ATC ATC CAG CTC AAA CTGGAA CAA CAG GCG TTA ATG AAA GAG 1296 Arg Arg Ile Ile Gln Leu Lys Leu GluGln Gln Ala Leu Met Lys Glu 420 425 430 TCT GAT GAA GCC AGT AAA AAA CGTCTG GAT ATG CTC AAC GAA GAA CTG 1344 Ser Asp Glu Ala Ser Lys Lys Arg LeuAsp Met Leu Asn Glu Glu Leu 435 440 445 AGC GAC AAA GAA CGT CAG TAC TCCGAG TTA GAA GAA GAG TGG AAA GCA 1392 Ser Asp Lys Glu Arg Gln Tyr Ser GluLeu Glu Glu Glu Trp Lys Ala 450 455 460 GAG AAG GCA TCG CTT TCT GGT ACGCAG ACC ATT AAA GCG GAA CTG GAA 1440 Glu Lys Ala Ser Leu Ser Gly Thr GlnThr Ile Lys Ala Glu Leu Glu 465 470 475 480 CAG GCG AAA ATC GCT ATT GAACAG GCT CGC CGT GTG GGG GAC CTG GCG 1488 Gln Ala Lys Ile Ala Ile Glu GlnAla Arg Arg Val Gly Asp Leu Ala 485 490 495 CGG ATG TCT GAA CTG CAA TACGGC AAA ATC CCG GAA CTG GAA AAG CAA 1536 Arg Met Ser Glu Leu Gln Tyr GlyLys Ile Pro Glu Leu Glu Lys Gln 500 505 510 CTG GAA GCC GCA ACG CAG CTCGAA GGC AAA ACT ATG CGT CTG TTG CGT 1584 Leu Glu Ala Ala Thr Gln Leu GluGly Lys Thr Met Arg Leu Leu Arg 515 520 525 AAT AAA GTG ACC GAC GCC GAAATT GCT GAA GTG CTG GCG CGT TGG ACG 1632 Asn Lys Val Thr Asp Ala Glu IleAla Glu Val Leu Ala Arg Trp Thr 530 535 540 GGG ATT CCG GTT TCT CGC ATGATG GAA AGC GAG CGC GAA AAA CTG CTG 1680 Gly Ile Pro Val Ser Arg Met MetGlu Ser Glu Arg Glu Lys Leu Leu 545 550 555 560 CGT ATG GAG CAA GAA CTGCAC CAT CGC GTA ATT GGT CAG AAC GAA GCG 1728 Arg Met Glu Gln Glu Leu HisHis Arg Val Ile Gly Gln Asn Glu Ala 565 570 575 GTT GAT GCG GTA TCT AACGCT ATT CGT CGT AGC CGT GCG GGG CTG GCG 1776 Val Asp Ala Val Ser Asn AlaIle Arg Arg Ser Arg Ala Gly Leu Ala 580 585 590 GAT CCA AAT CGC CCG ATTGGT TCA TTC CTG TTC CTC GGC CCA ACT GGT 1824 Asp Pro Asn Arg Pro Ile GlySer Phe Leu Phe Leu Gly Pro Thr Gly 595 600 605 GTG GGG AAA ACA GAG CTTTGT AAG GCG CTG GCG AAC TTT ATG TTT GAT 1872 Val Gly Lys Thr Glu Leu CysLys Ala Leu Ala Asn Phe Met Phe Asp 610 615 620 AGC GAC GAG GCG ATG GTCCGT ATC GAT ATG TCC GAG TTT ATG GAG AAA 1920 Ser Asp Glu Ala Met Val ArgIle Asp Met Ser Glu Phe Met Glu Lys 625 630 635 640 CAC TCG GTG TCT CGTTTG GTT GGT GCG CCT CCG GGA TAT GTC GGT TAT 1968 His Ser Val Ser Arg LeuVal Gly Ala Pro Pro Gly Tyr Val Gly Tyr 645 650 655 GAA GAA GGT GGC TACCTG ACC GAA GCG GTG CGT CGT CGT CCG TAT TCC 2016 Glu Glu Gly Gly Tyr LeuThr Glu Ala Val Arg Arg Arg Pro Tyr Ser 660 665 670 GTC ATC CTG CTG GATGAA GTG GAA AAA GCG CAT CCG GAT GTC TTC AAC 2064 Val Ile Leu Leu Asp GluVal Glu Lys Ala His Pro Asp Val Phe Asn 675 680 685 ATT CTG TTG CAG GTACTG GAT GAT GGG CGT CTG ACT GAC GGG CAA GGG 2112 Ile Leu Leu Gln Val LeuAsp Asp Gly Arg Leu Thr Asp Gly Gln Gly 690 695 700 AGA ACG GTC GAC TTCCGT AAT ACG GTC GTC ATT ATG ACC TCT AAC CTC 2160 Arg Thr Val Asp Phe ArgAsn Thr Val Val Ile Met Thr Ser Asn Leu 705 710 715 720 GGT TCC GAT CTGATT CAG GAA CGC TTC GGT GAA CTG GAT TAT GCG CAC 2208 Gly Ser Asp Leu IleGln Glu Arg Phe Gly Glu Leu Asp Tyr Ala His 725 730 735 ATG AAA GAG CTGGTG CTC GGT GTG GTA AGC CAT AAC TTC CGT CCG GAA 2256 Met Lys Glu Leu ValLeu Gly Val Val Ser His Asn Phe Arg Pro Glu 740 745 750 TTC ATT AAC CGTATC GAT GAA GTG GTG GTC TTC CAT CCG CTG GGT GAA 2304 Phe Ile Asn Arg IleAsp Glu Val Val Val Phe His Pro Leu Gly Glu 755 760 765 CAG CAC ATT GCCTCG ATT GCG CAG ATT CAG TTG AAA CGT CTG TAC AAA 2352 Gln His Ile Ala SerIle Ala Gln Ile Gln Leu Lys Arg Leu Tyr Lys 770 775 780 CGT CTG GAA GAACGT GGT TAT GAA ATC CAC ATT TCT GAC GAG GCG CTG 2400 Arg Leu Glu Glu ArgGly Tyr Glu Ile His Ile Ser Asp Glu Ala Leu 785 790 795 800 AAA CTG CTGAGC GAG AAC GGT TAC GAT CCG GTC TAT GGT GCA CGT CCT 2448 Lys Leu Leu SerGlu Asn Gly Tyr Asp Pro Val Tyr Gly Ala Arg Pro 805 810 815 CTG AAA CGTGCA ATT CAG CAG CAG ATC GAA AAC CCG CTG GCA CAG CAA 2496 Leu Lys Arg AlaIle Gln Gln Gln Ile Glu Asn Pro Leu Ala Gln Gln 820 825 830 ATA CTG TCTGGT GAA TTG GTT CCG GGT AAA GTG ATT CGC CTG GAA GTT 2544 Ile Leu Ser GlyGlu Leu Val Pro Gly Lys Val Ile Arg Leu Glu Val 835 840 845 AAT GAA GACCGG ATT GTC GCC GTC CAG TAA 2574 Asn Glu Asp Arg Ile Val Ala Val Gln 850855 857 amino acids amino acid linear protein 47 Met Arg Leu Asp Arg LeuThr Asn Lys Phe Gln Leu Ala Leu Ala Asp 1 5 10 15 Ala Gln Ser Leu AlaLeu Gly His Asp Asn Gln Phe Ile Glu Pro Leu 20 25 30 His Leu Met Ser AlaLeu Leu Asn Gln Glu Gly Gly Ser Val Ser Pro 35 40 45 Leu Leu Thr Ser AlaGly Ile Asn Ala Gly Gln Leu Arg Thr Asp Ile 50 55 60 Asn Gln Ala Leu AsnArg Leu Pro Gln Val Glu Gly Thr Gly Gly Asp 65 70 75 80 Val Gln Pro SerGln Asp Leu Val Arg Val Leu Asn Leu Cys Asp Asn 85 90 95 Val Ala Gln LysArg Gly Asp Asn Phe Ile Ser Ser Glu Leu Phe Val 100 105 110 Leu Ala AlaLeu Glu Ser Arg Gly Thr Val Ala Asp Ile Leu Lys Ala 115 120 125 Ala GlyAla Thr Thr Ala Asn Ile Thr Gln Ala Ile Glu Gln Met Arg 130 135 140 GlyGly Glu Ser Val Asn Asp Gln Gly Ala Glu Asp Gln Arg Gln Ala 145 150 155160 Leu Lys Lys Tyr Thr Ile Asp Leu Thr Glu Arg Ala Glu Gln Gly Lys 165170 175 Leu Asp Pro Val Ile Gly Arg Asp Glu Glu Ile Arg Arg Thr Ile Gln180 185 190 Val Leu Gln Arg Arg Thr Lys Asn Asn Pro Val Leu Ile Gly GluPro 195 200 205 Gly Val Gly Lys Thr Ala Ile Val Glu Gly Leu Ala Gln ArgIle Ile 210 215 220 Asn Gly Glu Val Pro Glu Gly Leu Lys Gly Arg Arg ValLeu Ala Leu 225 230 235 240 Asp Met Gly Ala Leu Val Ala Gly Ala Lys TyrArg Gly Glu Phe Glu 245 250 255 Glu Arg Leu Lys Gly Val Leu Asn Asp LeuAla Lys Gln Glu Gly Asn 260 265 270 Val Ile Leu Phe Ile Asp Glu Leu HisThr Met Val Gly Ala Gly Lys 275 280 285 Ala Asp Gly Ala Met Asp Ala GlyAsn Met Leu Lys Pro Ala Leu Ala 290 295 300 Arg Gly Glu Leu His Cys ValGly Ala Thr Thr Leu Asp Glu Tyr Arg 305 310 315 320 Gln Tyr Ile Glu LysAsp Ala Ala Leu Glu Arg Arg Phe Gln Lys Val 325 330 335 Phe Val Ala GluPro Ser Val Glu Asp Thr Ile Ala Ile Leu Arg Gly 340 345 350 Leu Lys GluArg Tyr Glu Leu His His His Val Gln Ile Thr Asp Pro 355 360 365 Ala IleVal Ala Ala Ala Thr Leu Ser His Arg Tyr Ile Ala Asp Arg 370 375 380 GlnLeu Pro Asp Lys Ala Ile Asp Leu Ile Asp Glu Ala Ala Ser Ser 385 390 395400 Ile Arg Met Gln Ile Asp Ser Lys Pro Glu Glu Leu Asp Arg Leu Asp 405410 415 Arg Arg Ile Ile Gln Leu Lys Leu Glu Gln Gln Ala Leu Met Lys Glu420 425 430 Ser Asp Glu Ala Ser Lys Lys Arg Leu Asp Met Leu Asn Glu GluLeu 435 440 445 Ser Asp Lys Glu Arg Gln Tyr Ser Glu Leu Glu Glu Glu TrpLys Ala 450 455 460 Glu Lys Ala Ser Leu Ser Gly Thr Gln Thr Ile Lys AlaGlu Leu Glu 465 470 475 480 Gln Ala Lys Ile Ala Ile Glu Gln Ala Arg ArgVal Gly Asp Leu Ala 485 490 495 Arg Met Ser Glu Leu Gln Tyr Gly Lys IlePro Glu Leu Glu Lys Gln 500 505 510 Leu Glu Ala Ala Thr Gln Leu Glu GlyLys Thr Met Arg Leu Leu Arg 515 520 525 Asn Lys Val Thr Asp Ala Glu IleAla Glu Val Leu Ala Arg Trp Thr 530 535 540 Gly Ile Pro Val Ser Arg MetMet Glu Ser Glu Arg Glu Lys Leu Leu 545 550 555 560 Arg Met Glu Gln GluLeu His His Arg Val Ile Gly Gln Asn Glu Ala 565 570 575 Val Asp Ala ValSer Asn Ala Ile Arg Arg Ser Arg Ala Gly Leu Ala 580 585 590 Asp Pro AsnArg Pro Ile Gly Ser Phe Leu Phe Leu Gly Pro Thr Gly 595 600 605 Val GlyLys Thr Glu Leu Cys Lys Ala Leu Ala Asn Phe Met Phe Asp 610 615 620 SerAsp Glu Ala Met Val Arg Ile Asp Met Ser Glu Phe Met Glu Lys 625 630 635640 His Ser Val Ser Arg Leu Val Gly Ala Pro Pro Gly Tyr Val Gly Tyr 645650 655 Glu Glu Gly Gly Tyr Leu Thr Glu Ala Val Arg Arg Arg Pro Tyr Ser660 665 670 Val Ile Leu Leu Asp Glu Val Glu Lys Ala His Pro Asp Val PheAsn 675 680 685 Ile Leu Leu Gln Val Leu Asp Asp Gly Arg Leu Thr Asp GlyGln Gly 690 695 700 Arg Thr Val Asp Phe Arg Asn Thr Val Val Ile Met ThrSer Asn Leu 705 710 715 720 Gly Ser Asp Leu Ile Gln Glu Arg Phe Gly GluLeu Asp Tyr Ala His 725 730 735 Met Lys Glu Leu Val Leu Gly Val Val SerHis Asn Phe Arg Pro Glu 740 745 750 Phe Ile Asn Arg Ile Asp Glu Val ValVal Phe His Pro Leu Gly Glu 755 760 765 Gln His Ile Ala Ser Ile Ala GlnIle Gln Leu Lys Arg Leu Tyr Lys 770 775 780 Arg Leu Glu Glu Arg Gly TyrGlu Ile His Ile Ser Asp Glu Ala Leu 785 790 795 800 Lys Leu Leu Ser GluAsn Gly Tyr Asp Pro Val Tyr Gly Ala Arg Pro 805 810 815 Leu Lys Arg AlaIle Gln Gln Gln Ile Glu Asn Pro Leu Ala Gln Gln 820 825 830 Ile Leu SerGly Glu Leu Val Pro Gly Lys Val Ile Arg Leu Glu Val 835 840 845 Asn GluAsp Arg Ile Val Ala Val Gln 850 855 1239 base pairs nucleic acid singlelinear other nucleic acid /desc = ”DNA (genomic) (p13c3)“ CDS 1..1236 48ATG ATG AAA GAA AAA GTG ATT TTT CTC GTT GAC ATG CAA TCG TTT TAT 48 MetMet Lys Glu Lys Val Ile Phe Leu Val Asp Met Gln Ser Phe Tyr 1 5 10 15GCA TCT GTA GAG AAA GCG GAA AAT CCA CAT TTG AAA AAT AGG CCC GTC 96 AlaSer Val Glu Lys Ala Glu Asn Pro His Leu Lys Asn Arg Pro Val 20 25 30 ATTGTT TCG GGT GAC CCT GAA AAA AGG GGC GGA GTC GTA TTG GCT GCC 144 Ile ValSer Gly Asp Pro Glu Lys Arg Gly Gly Val Val Leu Ala Ala 35 40 45 TGC CCGCTG GCG AAA CAA AAG GGT GTG GTG AAT GCT TCA CGG CTG TGG 192 Cys Pro LeuAla Lys Gln Lys Gly Val Val Asn Ala Ser Arg Leu Trp 50 55 60 GAG GCG CAGGAA AAG TGT CCT GAG GCT GTT GTG CTC CGG CCG CGT ATG 240 Glu Ala Gln GluLys Cys Pro Glu Ala Val Val Leu Arg Pro Arg Met 65 70 75 80 CAG CGG TATATT GAT GTA TCA CTG CAA ATT ACG GCC ATT CTC GAG GAG 288 Gln Arg Tyr IleAsp Val Ser Leu Gln Ile Thr Ala Ile Leu Glu Glu 85 90 95 TAT ACA GAC CTTGTG GAG CCG TAT TCC ATC GAT GAA CAG TTC ATG GAC 336 Tyr Thr Asp Leu ValGlu Pro Tyr Ser Ile Asp Glu Gln Phe Met Asp 100 105 110 ATT ACA GGC AGCCAG AAG CTG TTT GGG ACG CCG ATG GAG ATC GCG AAA 384 Ile Thr Gly Ser GlnLys Leu Phe Gly Thr Pro Met Glu Ile Ala Lys 115 120 125 AGC ATT CAG GGCAGA ATC ATG CGG GAG ATC GGC GTT TAT GCA CGG GTC 432 Ser Ile Gln Gly ArgIle Met Arg Glu Ile Gly Val Tyr Ala Arg Val 130 135 140 GGA ATC GGC CCTAAC AAA GCG CTG GCC AAA ATT GCG TGT GAC AAT TTT 480 Gly Ile Gly Pro AsnLys Ala Leu Ala Lys Ile Ala Cys Asp Asn Phe 145 150 155 160 GCC AAA AAGAAT AAG AAC GGT ATT TTT ACC TTA ACG AAA GAA AAT ATG 528 Ala Lys Lys AsnLys Asn Gly Ile Phe Thr Leu Thr Lys Glu Asn Met 165 170 175 AAA ACC GAAATG TGG CCG CTC CCG GTG GGC AGC ATG TTT GGC GTC GGG 576 Lys Thr Glu MetTrp Pro Leu Pro Val Gly Ser Met Phe Gly Val Gly 180 185 190 AGC CGC ATGAAG CAT CAT TTA AAT CGA ATG GGC ATC AGC ACG ATC GGC 624 Ser Arg Met LysHis His Leu Asn Arg Met Gly Ile Ser Thr Ile Gly 195 200 205 GGG CTC GCGGCT TTT CCG CTC GAT CTT TTA AAA AAG AAA TGG GGC ATT 672 Gly Leu Ala AlaPhe Pro Leu Asp Leu Leu Lys Lys Lys Trp Gly Ile 210 215 220 AAC GGC CACGTG CTG TGG ATG ACG GCA AAC GGA ATC GAC TAT TCC CCT 720 Asn Gly His ValLeu Trp Met Thr Ala Asn Gly Ile Asp Tyr Ser Pro 225 230 235 240 GTG TCAACT TCG TCT CTG GAC GGG CAA AAG GCG ATA GGT CAT GGA ATG 768 Val Ser ThrSer Ser Leu Asp Gly Gln Lys Ala Ile Gly His Gly Met 245 250 255 ACT CTCCCG AGA GAC TAC GAA CAC TTT GAC AAA GAA ATC AAG GTC GTA 816 Thr Leu ProArg Asp Tyr Glu His Phe Asp Lys Glu Ile Lys Val Val 260 265 270 TTG CTTGAG CTG AGT GAA GAG GTG TGC AGG CGA AGC CGA AAC GCC GGG 864 Leu Leu GluLeu Ser Glu Glu Val Cys Arg Arg Ser Arg Asn Ala Gly 275 280 285 GTC ATGGGG CAG ACA GTG TCA GTG AGC TGC CGG GGT GCT GAT TTT GAT 912 Val Met GlyGln Thr Val Ser Val Ser Cys Arg Gly Ala Asp Phe Asp 290 295 300 TGG CCGACG GGC TTC AAC CGG CAA GTG AAG CTG GCA GAG CCG ACT AAT 960 Trp Pro ThrGly Phe Asn Arg Gln Val Lys Leu Ala Glu Pro Thr Asn 305 310 315 320 TCTACG CAG GAT GTA TAT GAG GCT GTA CGA CGG CTG TTT CTT ACA TTT 1008 Ser ThrGln Asp Val Tyr Glu Ala Val Arg Arg Leu Phe Leu Thr Phe 325 330 335 TGGGAC GGG AAA CCC GTC CGC CGC CTC GGT GTC AAT CTG TCT CAG CTC 1056 Trp AspGly Lys Pro Val Arg Arg Leu Gly Val Asn Leu Ser Gln Leu 340 345 350 TCATCT GAT GAC ATA TGG CAG CTC AAT TTA TTT CAG GAT TAT GCA AAG 1104 Ser SerAsp Asp Ile Trp Gln Leu Asn Leu Phe Gln Asp Tyr Ala Lys 355 360 365 AAAATG AGC CTA GGC TAT GTG ATG GAT GGC ATT AAA AAT CGA TTC GGC 1152 Lys MetSer Leu Gly Tyr Val Met Asp Gly Ile Lys Asn Arg Phe Gly 370 375 380 GATACA GCA ATC ATC AGG GCG GCG TCA CTG ACA GCG GCA GGC CAG GCA 1200 Asp ThrAla Ile Ile Arg Ala Ala Ser Leu Thr Ala Ala Gly Gln Ala 385 390 395 400TTT GAA CGT GCG GCT AAA ATA GGG GGG CAT TAT AAA TGA 1239 Phe Glu Arg AlaAla Lys Ile Gly Gly His Tyr Lys 405 410 412 amino acids amino acidlinear protein 49 Met Met Lys Glu Lys Val Ile Phe Leu Val Asp Met GlnSer Phe Tyr 1 5 10 15 Ala Ser Val Glu Lys Ala Glu Asn Pro His Leu LysAsn Arg Pro Val 20 25 30 Ile Val Ser Gly Asp Pro Glu Lys Arg Gly Gly ValVal Leu Ala Ala 35 40 45 Cys Pro Leu Ala Lys Gln Lys Gly Val Val Asn AlaSer Arg Leu Trp 50 55 60 Glu Ala Gln Glu Lys Cys Pro Glu Ala Val Val LeuArg Pro Arg Met 65 70 75 80 Gln Arg Tyr Ile Asp Val Ser Leu Gln Ile ThrAla Ile Leu Glu Glu 85 90 95 Tyr Thr Asp Leu Val Glu Pro Tyr Ser Ile AspGlu Gln Phe Met Asp 100 105 110 Ile Thr Gly Ser Gln Lys Leu Phe Gly ThrPro Met Glu Ile Ala Lys 115 120 125 Ser Ile Gln Gly Arg Ile Met Arg GluIle Gly Val Tyr Ala Arg Val 130 135 140 Gly Ile Gly Pro Asn Lys Ala LeuAla Lys Ile Ala Cys Asp Asn Phe 145 150 155 160 Ala Lys Lys Asn Lys AsnGly Ile Phe Thr Leu Thr Lys Glu Asn Met 165 170 175 Lys Thr Glu Met TrpPro Leu Pro Val Gly Ser Met Phe Gly Val Gly 180 185 190 Ser Arg Met LysHis His Leu Asn Arg Met Gly Ile Ser Thr Ile Gly 195 200 205 Gly Leu AlaAla Phe Pro Leu Asp Leu Leu Lys Lys Lys Trp Gly Ile 210 215 220 Asn GlyHis Val Leu Trp Met Thr Ala Asn Gly Ile Asp Tyr Ser Pro 225 230 235 240Val Ser Thr Ser Ser Leu Asp Gly Gln Lys Ala Ile Gly His Gly Met 245 250255 Thr Leu Pro Arg Asp Tyr Glu His Phe Asp Lys Glu Ile Lys Val Val 260265 270 Leu Leu Glu Leu Ser Glu Glu Val Cys Arg Arg Ser Arg Asn Ala Gly275 280 285 Val Met Gly Gln Thr Val Ser Val Ser Cys Arg Gly Ala Asp PheAsp 290 295 300 Trp Pro Thr Gly Phe Asn Arg Gln Val Lys Leu Ala Glu ProThr Asn 305 310 315 320 Ser Thr Gln Asp Val Tyr Glu Ala Val Arg Arg LeuPhe Leu Thr Phe 325 330 335 Trp Asp Gly Lys Pro Val Arg Arg Leu Gly ValAsn Leu Ser Gln Leu 340 345 350 Ser Ser Asp Asp Ile Trp Gln Leu Asn LeuPhe Gln Asp Tyr Ala Lys 355 360 365 Lys Met Ser Leu Gly Tyr Val Met AspGly Ile Lys Asn Arg Phe Gly 370 375 380 Asp Thr Ala Ile Ile Arg Ala AlaSer Leu Thr Ala Ala Gly Gln Ala 385 390 395 400 Phe Glu Arg Ala Ala LysIle Gly Gly His Tyr Lys 405 410 1452 base pairs nucleic acid singlelinear other nucleic acid /desc = ”DNA (genomic) (p4b3)“ CDS 1..1449 50ATG AGT AAA ATT ATT GGA TCA GAC AGA GTC AAA AGA GGT ATG GCT GAA 48 MetSer Lys Ile Ile Gly Ser Asp Arg Val Lys Arg Gly Met Ala Glu 1 5 10 15ATG CAA AAA GGC GGC GTT ATT ATG GAT GTC GTT AAT GCT GAG CAA GCA 96 MetGln Lys Gly Gly Val Ile Met Asp Val Val Asn Ala Glu Gln Ala 20 25 30 AGAATT GCA GAA GAA GCT GGC GCG GTA GCA GTT ATG GCA TTA GAA CGA 144 Arg IleAla Glu Glu Ala Gly Ala Val Ala Val Met Ala Leu Glu Arg 35 40 45 GTA CCTTCT GAT ATT AGA GCT GCT GGT GGT GTT GCA CGT ATG GCA AAC 192 Val Pro SerAsp Ile Arg Ala Ala Gly Gly Val Ala Arg Met Ala Asn 50 55 60 CCT AAA ATTGTA GAA GAA GTA ATG AAT GCT GTT TCT ATT CCA GTC ATG 240 Pro Lys Ile ValGlu Glu Val Met Asn Ala Val Ser Ile Pro Val Met 65 70 75 80 GCT AAA GCACGT ATT GGT CAT ATC ACT GAA GCA AGA GTA TTA GAG GCG 288 Ala Lys Ala ArgIle Gly His Ile Thr Glu Ala Arg Val Leu Glu Ala 85 90 95 ATG GGT GTT GACTAT ATT GAT GAA TCA GAA GTG TTA ACA CCA GCA GAT 336 Met Gly Val Asp TyrIle Asp Glu Ser Glu Val Leu Thr Pro Ala Asp 100 105 110 GAG GAA TAT CACTTA AGA AAA GAT CAA TTT ACA GTA CCA TTT GTA TGT 384 Glu Glu Tyr His LeuArg Lys Asp Gln Phe Thr Val Pro Phe Val Cys 115 120 125 GGA TGT CGT AATTTA GGT GAA GMT GCG CGT AGA ATT GGT GAA GGT GCT 432 Gly Cys Arg Asn LeuGly Glu Xaa Ala Arg Arg Ile Gly Glu Gly Ala 130 135 140 GCT ATG TTA CGTACT AAA GGT GAA CCA GGT ACA GGT AAT ATT GTT GAA 480 Ala Met Leu Arg ThrLys Gly Glu Pro Gly Thr Gly Asn Ile Val Glu 145 150 155 160 GCT GTA AGACAT ATG AGA CAA GTT AAT TCA GAA GTT AGT CGA TTG ACT 528 Ala Val Arg HisMet Arg Gln Val Asn Ser Glu Val Ser Arg Leu Thr 165 170 175 GTA ATG AATGAT GAT GAG ATT ATG ACT TTT GCG AAA GAT ATC GGT GCG 576 Val Met Asn AspAsp Glu Ile Met Thr Phe Ala Lys Asp Ile Gly Ala 180 185 190 CCT TAT GAAATT TTA AAA CAA ATT AAA GAC AAT GGT CGT TTA CCG GTA 624 Pro Tyr Glu IleLeu Lys Gln Ile Lys Asp Asn Gly Arg Leu Pro Val 195 200 205 GTT AAC TTTGCA GCT GGT GGC GTT GCG ACT CCT CAA GAT GCT GCT TTA 672 Val Asn Phe AlaAla Gly Gly Val Ala Thr Pro Gln Asp Ala Ala Leu 210 215 220 ATG ATG GAATTA GGT GCT GAC GGT GTA TTC GTT GGA TCA GGT ATT TTT 720 Met Met Glu LeuGly Ala Asp Gly Val Phe Val Gly Ser Gly Ile Phe 225 230 235 240 AAA TCAGAA GAT CCA GAA AAA TTT GCT AAA GCA ATT GTT CAA GCA ACA 768 Lys Ser GluAsp Pro Glu Lys Phe Ala Lys Ala Ile Val Gln Ala Thr 245 250 255 ACA CATTAC CAA GAC TAT GAA CTA ATT GGA AGA TTA GCA AGT GAA CTT 816 Thr His TyrGln Asp Tyr Glu Leu Ile Gly Arg Leu Ala Ser Glu Leu 260 265 270 GGC ACTGCT ATG AAA GGT TTA GAT ATC AAT CAA TTA TCA TTA GAA GAA 864 Gly Thr AlaMet Lys Gly Leu Asp Ile Asn Gln Leu Ser Leu Glu Glu 275 280 285 CGT ATGCAA GAG CGT GGT TGG TAA GAT ATG AAA ATA GGT GTA TTA GCA 912 Arg Met GlnGlu Arg Gly Trp Xaa Asp Met Lys Ile Gly Val Leu Ala 290 295 300 TTA CAAGGT GCA GTA CGT GAA CAT ATT AGA CAT ATT GAA TTA AGT GGT 960 Leu Gln GlyAla Val Arg Glu His Ile Arg His Ile Glu Leu Ser Gly 305 310 315 320 CATGAA GGT ATT GCA GTT AAA AAA GTT GAA CAA TTA GAA GAA ATC GAG 1008 His GluGly Ile Ala Val Lys Lys Val Glu Gln Leu Glu Glu Ile Glu 325 330 335 GGCTTA ATA TTA CCT GGT GGC GAG TCT ACA ACG TTA CGT CGA TTA AT G 1056 GlyLeu Ile Leu Pro Gly Gly Glu Ser Thr Thr Leu Arg Arg Leu Met 340 345 350AAT TTA TAT GGA TTT AAA GAG GCT TTA CAA AAT TCA ACT TTA CCT ATG 1104 AsnLeu Tyr Gly Phe Lys Glu Ala Leu Gln Asn Ser Thr Leu Pro Met 355 360 365TTT GGT ACA TGC GCA GGA TTA ATA GTT CTA GCG CAA GAT ATA GTT GGT 1152 PheGly Thr Cys Ala Gly Leu Ile Val Leu Ala Gln Asp Ile Val Gly 370 375 380GAA GAA GGA TAC CTT AAC AAG TTG AAT ATT ACT GTA CAA CGA AAC TCA 1200 GluGlu Gly Tyr Leu Asn Lys Leu Asn Ile Thr Val Gln Arg Asn Ser 385 390 395400 TTC GGT AGA CAA GTT GAC AGC TTT GAA ACA GAA TTA GAT ATT AAA GGT 1248Phe Gly Arg Gln Val Asp Ser Phe Glu Thr Glu Leu Asp Ile Lys Gly 405 410415 ATC GCT ACA GAT ATT GAA GGT GTC TTT ATA AGA GCC CCA CAT ATT GAA 1296Ile Ala Thr Asp Ile Glu Gly Val Phe Ile Arg Ala Pro His Ile Glu 420 425430 AAA GTA GGT CAA GGC GTA GAT ATC CTA TGT AAG GTT AAT GAG AAA ATT 1344Lys Val Gly Gln Gly Val Asp Ile Leu Cys Lys Val Asn Glu Lys Ile 435 440445 GTA GCT GTT CAG CAA GGT AAA TAT TTA GGC GTA TCA TTC CAT CCT GAA 1392Val Ala Val Gln Gln Gly Lys Tyr Leu Gly Val Ser Phe His Pro Glu 450 455460 TTA ACA GAT GAC TAT AGA GTA ACT GAT TAC TTT ATT AAT CAT ATT GTA 1440Leu Thr Asp Asp Tyr Arg Val Thr Asp Tyr Phe Ile Asn His Ile Val 465 470475 480 AAA AAA GCA TAG 1452 Lys Lys Ala 483 amino acids amino acidlinear protein 51 Met Ser Lys Ile Ile Gly Ser Asp Arg Val Lys Arg GlyMet Ala Glu 1 5 10 15 Met Gln Lys Gly Gly Val Ile Met Asp Val Val AsnAla Glu Gln Ala 20 25 30 Arg Ile Ala Glu Glu Ala Gly Ala Val Ala Val MetAla Leu Glu Arg 35 40 45 Val Pro Ser Asp Ile Arg Ala Ala Gly Gly Val AlaArg Met Ala Asn 50 55 60 Pro Lys Ile Val Glu Glu Val Met Asn Ala Val SerIle Pro Val Met 65 70 75 80 Ala Lys Ala Arg Ile Gly His Ile Thr Glu AlaArg Val Leu Glu Ala 85 90 95 Met Gly Val Asp Tyr Ile Asp Glu Ser Glu ValLeu Thr Pro Ala Asp 100 105 110 Glu Glu Tyr His Leu Arg Lys Asp Gln PheThr Val Pro Phe Val Cys 115 120 125 Gly Cys Arg Asn Leu Gly Glu Xaa AlaArg Arg Ile Gly Glu Gly Ala 130 135 140 Ala Met Leu Arg Thr Lys Gly GluPro Gly Thr Gly Asn Ile Val Glu 145 150 155 160 Ala Val Arg His Met ArgGln Val Asn Ser Glu Val Ser Arg Leu Thr 165 170 175 Val Met Asn Asp AspGlu Ile Met Thr Phe Ala Lys Asp Ile Gly Ala 180 185 190 Pro Tyr Glu IleLeu Lys Gln Ile Lys Asp Asn Gly Arg Leu Pro Val 195 200 205 Val Asn PheAla Ala Gly Gly Val Ala Thr Pro Gln Asp Ala Ala Leu 210 215 220 Met MetGlu Leu Gly Ala Asp Gly Val Phe Val Gly Ser Gly Ile Phe 225 230 235 240Lys Ser Glu Asp Pro Glu Lys Phe Ala Lys Ala Ile Val Gln Ala Thr 245 250255 Thr His Tyr Gln Asp Tyr Glu Leu Ile Gly Arg Leu Ala Ser Glu Leu 260265 270 Gly Thr Ala Met Lys Gly Leu Asp Ile Asn Gln Leu Ser Leu Glu Glu275 280 285 Arg Met Gln Glu Arg Gly Trp Xaa Asp Met Lys Ile Gly Val LeuAla 290 295 300 Leu Gln Gly Ala Val Arg Glu His Ile Arg His Ile Glu LeuSer Gly 305 310 315 320 His Glu Gly Ile Ala Val Lys Lys Val Glu Gln LeuGlu Glu Ile Glu 325 330 335 Gly Leu Ile Leu Pro Gly Gly Glu Ser Thr ThrLeu Arg Arg Leu Met 340 345 350 Asn Leu Tyr Gly Phe Lys Glu Ala Leu GlnAsn Ser Thr Leu Pro Met 355 360 365 Phe Gly Thr Cys Ala Gly Leu Ile ValLeu Ala Gln Asp Ile Val Gly 370 375 380 Glu Glu Gly Tyr Leu Asn Lys LeuAsn Ile Thr Val Gln Arg Asn Ser 385 390 395 400 Phe Gly Arg Gln Val AspSer Phe Glu Thr Glu Leu Asp Ile Lys Gly 405 410 415 Ile Ala Thr Asp IleGlu Gly Val Phe Ile Arg Ala Pro His Ile Glu 420 425 430 Lys Val Gly GlnGly Val Asp Ile Leu Cys Lys Val Asn Glu Lys Ile 435 440 445 Val Ala ValGln Gln Gly Lys Tyr Leu Gly Val Ser Phe His Pro Glu 450 455 460 Leu ThrAsp Asp Tyr Arg Val Thr Asp Tyr Phe Ile Asn His Ile Val 465 470 475 480Lys Lys Ala 977 base pairs nucleic acid single linear other nucleic acid/desc = ”DNA (genomic) (p4c63)“ CDS 1..975 52 AAC AAA GCC TTC CAA TTATCT GCG TCG GTA GAA CAA GTA TTA GCA ACT 48 Asn Lys Ala Phe Gln Leu SerAla Ser Val Glu Gln Val Leu Ala Thr 1 5 10 15 TTA TCA CCT ACG CTA AACAGT CCT TAC GAT TTA TAC GGC ACG ACA AAA 96 Leu Ser Pro Thr Leu Asn SerPro Tyr Asp Leu Tyr Gly Thr Thr Lys 20 25 30 ATG CTA GAT ATT ACA TTC GATTCA TTT GAA CAT GAT GGT ACA ACG TAC 144 Met Leu Asp Ile Thr Phe Asp SerPhe Glu His Asp Gly Thr Thr Tyr 35 40 45 CCT GTC GAC TAT GCT ACG TTT GAAAAT GAT TAT GAA GAT AAT AAA GAT 192 Pro Val Asp Tyr Ala Thr Phe Glu AsnAsp Tyr Glu Asp Asn Lys Asp 50 55 60 CCT GAG TTT AGA CGT AAA AGT TTC AAATCG TTT AGC GAT GGG ATT CGA 240 Pro Glu Phe Arg Arg Lys Ser Phe Lys SerPhe Ser Asp Gly Ile Arg 65 70 75 80 AAA TAT CAG CAT ACT ACC GCG GCT ACATAT AAT ATG CAA GTA CAA CAA 288 Lys Tyr Gln His Thr Thr Ala Ala Thr TyrAsn Met Gln Val Gln Gln 85 90 95 GAA AAA ATT GAA GCT GAT TTA CGT GGA TTTGAA TCA GTC ATC GAT TAT 336 Glu Lys Ile Glu Ala Asp Leu Arg Gly Phe GluSer Val Ile Asp Tyr 100 105 110 TTA TTA CAT AGT CAA GAA GTA ACG CGT GATATG TTT GAC CGT CAA ATC 384 Leu Leu His Ser Gln Glu Val Thr Arg Asp MetPhe Asp Arg Gln Ile 115 120 125 GAT ATG ATT ATG CGT GAC TTG GCA CCA GTTATG CAG AAA TAT GCT AAA 432 Asp Met Ile Met Arg Asp Leu Ala Pro Val MetGln Lys Tyr Ala Lys 130 135 140 CTT TTA CAA CGT ATT CAC GGA TTA GAT AACATG CGT TTT GAA GAC TTG 480 Leu Leu Gln Arg Ile His Gly Leu Asp Asn MetArg Phe Glu Asp Leu 145 150 155 160 AAG ATT TCT GTA GAC CCT GAT TAT GAACCA GAG ATT TCA ATT GAA GAC 528 Lys Ile Ser Val Asp Pro Asp Tyr Glu ProGlu Ile Ser Ile Glu Asp 165 170 175 TCA AAA AAT TAT ATT TTC GGT GCG TTAAGT GTT TTA GGT GAT GAC TAT 576 Ser Lys Asn Tyr Ile Phe Gly Ala Leu SerVal Leu Gly Asp Asp Tyr 180 185 190 ACA AAC ATG TTA CGT GAA GCA TAC GATCAG CGA TGG ATT GAT TTT GCA 624 Thr Asn Met Leu Arg Glu Ala Tyr Asp GlnArg Trp Ile Asp Phe Ala 195 200 205 CAA AAT AAA GGT AAA GAT ACA GGC GCATTT TGT GCA AGT CCA TAC TTT 672 Gln Asn Lys Gly Lys Asp Thr Gly Ala PheCys Ala Ser Pro Tyr Phe 210 215 220 ACA CAT TCA TAT GTG TTT ATT TCT TGGACT GGT AAA ATG GCT GAA GCA 720 Thr His Ser Tyr Val Phe Ile Ser Trp ThrGly Lys Met Ala Glu Ala 225 230 235 240 TTT GTC TTA GCA CAT GAA TTA GGTCAT GCA GGT CAT TTT ACA TTA GCT 768 Phe Val Leu Ala His Glu Leu Gly HisAla Gly His Phe Thr Leu Ala 245 250 255 CAA AAA CAT CAA CCA TAT CTT GAATCA GAA GCA TCA ATG TAC TTT GTT 816 Gln Lys His Gln Pro Tyr Leu Glu SerGlu Ala Ser Met Tyr Phe Val 260 265 270 GAA GCC CCT TCT ACA ATG AAT GAAATG TTG ATG GCC AAT TAT TTA TTT 864 Glu Ala Pro Ser Thr Met Asn Glu MetLeu Met Ala Asn Tyr Leu Phe 275 280 285 AAC ACA AGT GAT AAT CCA AGA TTTAAG CGT TGG GTT ATT GGC TCA ATT 912 Asn Thr Ser Asp Asn Pro Arg Phe LysArg Trp Val Ile Gly Ser Ile 290 295 300 TTA TCT AGA ACA TAT TAT CAT AATATG GTA CCC MTT TAT TAG AAG CNG 960 Leu Ser Arg Thr Tyr Tyr His Asn MetVal Pro Xaa Tyr Xaa Lys Xaa 305 310 315 320 CTT ATC CAC GGG GAG TG 977Leu Ile His Gly Glu 325 325 amino acids amino acid linear protein 53 AsnLys Ala Phe Gln Leu Ser Ala Ser Val Glu Gln Val Leu Ala Thr 1 5 10 15Leu Ser Pro Thr Leu Asn Ser Pro Tyr Asp Leu Tyr Gly Thr Thr Lys 20 25 30Met Leu Asp Ile Thr Phe Asp Ser Phe Glu His Asp Gly Thr Thr Tyr 35 40 45Pro Val Asp Tyr Ala Thr Phe Glu Asn Asp Tyr Glu Asp Asn Lys Asp 50 55 60Pro Glu Phe Arg Arg Lys Ser Phe Lys Ser Phe Ser Asp Gly Ile Arg 65 70 7580 Lys Tyr Gln His Thr Thr Ala Ala Thr Tyr Asn Met Gln Val Gln Gln 85 9095 Glu Lys Ile Glu Ala Asp Leu Arg Gly Phe Glu Ser Val Ile Asp Tyr 100105 110 Leu Leu His Ser Gln Glu Val Thr Arg Asp Met Phe Asp Arg Gln Ile115 120 125 Asp Met Ile Met Arg Asp Leu Ala Pro Val Met Gln Lys Tyr AlaLys 130 135 140 Leu Leu Gln Arg Ile His Gly Leu Asp Asn Met Arg Phe GluAsp Leu 145 150 155 160 Lys Ile Ser Val Asp Pro Asp Tyr Glu Pro Glu IleSer Ile Glu Asp 165 170 175 Ser Lys Asn Tyr Ile Phe Gly Ala Leu Ser ValLeu Gly Asp Asp Tyr 180 185 190 Thr Asn Met Leu Arg Glu Ala Tyr Asp GlnArg Trp Ile Asp Phe Ala 195 200 205 Gln Asn Lys Gly Lys Asp Thr Gly AlaPhe Cys Ala Ser Pro Tyr Phe 210 215 220 Thr His Ser Tyr Val Phe Ile SerTrp Thr Gly Lys Met Ala Glu Ala 225 230 235 240 Phe Val Leu Ala His GluLeu Gly His Ala Gly His Phe Thr Leu Ala 245 250 255 Gln Lys His Gln ProTyr Leu Glu Ser Glu Ala Ser Met Tyr Phe Val 260 265 270 Glu Ala Pro SerThr Met Asn Glu Met Leu Met Ala Asn Tyr Leu Phe 275 280 285 Asn Thr SerAsp Asn Pro Arg Phe Lys Arg Trp Val Ile Gly Ser Ile 290 295 300 Leu SerArg Thr Tyr Tyr His Asn Met Val Pro Xaa Tyr Xaa Lys Xaa 305 310 315 320Leu Ile His Gly Glu 325 471 base pairs nucleic acid single linear othernucleic acid /desc = ”DNA (genomic) (p5c3)“ CDS 1..465 54 ATG TAT CAACTA CAA TTT ATA AAT TTA GTT TAC GAC ACA ACC AAA CTC 48 Met Tyr Gln LeuGln Phe Ile Asn Leu Val Tyr Asp Thr Thr Lys Leu 1 5 10 15 ACA CAT CTAGAA CAA ACC AAT ATC AAT TTA TTC ATT GGT AAT TGG AGT 96 Thr His Leu GluGln Thr Asn Ile Asn Leu Phe Ile Gly Asn Trp Ser 20 25 30 AAT CAT CAA TTACAA AAA TCA ATT TGT ATA CGT CAT GGC GAT GAT ACA 144 Asn His Gln Leu GlnLys Ser Ile Cys Ile Arg His Gly Asp Asp Thr 35 40 45 AGT CAC AAT CAA TATCAT ATT CTT TTT ATA GAT ACG GCA CAT CAA CGC 192 Ser His Asn Gln Tyr HisIle Leu Phe Ile Asp Thr Ala His Gln Arg 50 55 60 ATT AAA TTT TCA TCT ATTGAT AAT GAA GAA ATC ATT TAT ATT CTT GAT 240 Ile Lys Phe Ser Ser Ile AspAsn Glu Glu Ile Ile Tyr Ile Leu Asp 65 70 75 80 TAT GAT GAT ACA CAG CATATC CTC ATG CAA ACG TCA TCC AAA CAA GGT 288 Tyr Asp Asp Thr Gln His IleLeu Met Gln Thr Ser Ser Lys Gln Gly 85 90 95 ATT GGC ACT TCG CGA CCA ATCGTT TAT GAG CGC TTA GTA TAA CTA ATT 336 Ile Gly Thr Ser Arg Pro Ile ValTyr Glu Arg Leu Val Xaa Leu Ile 100 105 110 TAA ATG ATT TCA CTT CAT AAAGCG GGT TGG CGA GAA TTC AAT TTC TCA 384 Xaa Met Ile Ser Leu His Lys AlaGly Trp Arg Glu Phe Asn Phe Ser 115 120 125 CCA GCT CGT TTT TTC ATT GTAATA ATA ATC TTT AAC ATT TAT TCT TTC 432 Pro Ala Arg Phe Phe Ile Val IleIle Ile Phe Asn Ile Tyr Ser Phe 130 135 140 TCT ATT AAT TTT TCT CAA ACTATC TTA TCT TTA TGATAA 471 Ser Ile Asn Phe Ser Gln Thr Ile Leu Ser Leu145 150 155 155 amino acids amino acid linear protein 55 Met Tyr Gln LeuGln Phe Ile Asn Leu Val Tyr Asp Thr Thr Lys Leu 1 5 10 15 Thr His LeuGlu Gln Thr Asn Ile Asn Leu Phe Ile Gly Asn Trp Ser 20 25 30 Asn His GlnLeu Gln Lys Ser Ile Cys Ile Arg His Gly Asp Asp Thr 35 40 45 Ser His AsnGln Tyr His Ile Leu Phe Ile Asp Thr Ala His Gln Arg 50 55 60 Ile Lys PheSer Ser Ile Asp Asn Glu Glu Ile Ile Tyr Ile Leu Asp 65 70 75 80 Tyr AspAsp Thr Gln His Ile Leu Met Gln Thr Ser Ser Lys Gln Gly 85 90 95 Ile GlyThr Ser Arg Pro Ile Val Tyr Glu Arg Leu Val Xaa Leu Ile 100 105 110 XaaMet Ile Ser Leu His Lys Ala Gly Trp Arg Glu Phe Asn Phe Ser 115 120 125Pro Ala Arg Phe Phe Ile Val Ile Ile Ile Phe Asn Ile Tyr Ser Phe 130 135140 Ser Ile Asn Phe Ser Gln Thr Ile Leu Ser Leu 145 150 155 588 basepairs nucleic acid single linear other nucleic acid /desc = ”DNA(genomic) (p8d26)“ CDS 14..343 56 ATTTNTAGAA TAA TAT TTC CAT ATT GGA AAAAGG GAA GAA TTC GTT ATG 49 Tyr Phe His Ile Gly Lys Arg Glu Glu Phe ValMet 1 5 10 AAA AGC TAT AAG TGT AAA GGT TCA TTC TTA ATA GAT AGT ATG GCTGGA 97 Lys Ser Tyr Lys Cys Lys Gly Ser Phe Leu Ile Asp Ser Met Ala Gly15 20 25 TTT TTG CTA ATT GGA TTG ATN ACA TTA CTA TTG ATA CCA ATG ATG AAT145 Phe Leu Leu Ile Gly Leu Xaa Thr Leu Leu Leu Ile Pro Met Met Asn 3035 40 CAA ATG CAA GCG AGT ATA AAC CAT AAA CTA CAA ACA ATT GAT GCT TCT193 Gln Met Gln Ala Ser Ile Asn His Lys Leu Gln Thr Ile Asp Ala Ser 4550 55 60 AAA GTA ATT TTG ACG ACT GTA TCT AAA ATT AAT AAA GAA GAA CTT AAG241 Lys Val Ile Leu Thr Thr Val Ser Lys Ile Asn Lys Glu Glu Leu Lys 6570 75 AAG GGG GTA ACT ATA GGG AAG TAT GAT ATT AAG CAA AGT GAC CAA CAA289 Lys Gly Val Thr Ile Gly Lys Tyr Asp Ile Lys Gln Ser Asp Gln Gln 8085 90 ATT TGT GCT ATT TCA ANA AAT ACC ANT TCT TAT CAA AAG ACA TGT ATA337 Ile Cys Ala Ile Ser Xaa Asn Thr Xaa Ser Tyr Gln Lys Thr Cys Ile 95100 105 CAG TAT AAATGTCAAA GCTTTTTCGC TCATTGAAAT GTTAGTAGCG ATGATGGTTA393 Gln Tyr 110 TAAGTATAAC TTTACTAATT GTTCCAGACT TAATTAGACT TAGTAAAACTTTTCTAATTG 453 AAAGTAGGGA TTTAACAACT GTAGATTTCG AATTTTTCTC AAGAGATATTCTAGATGATT 513 TTAAAGGAGT AGATAGAAAC GATATTGAAA TTAGGCAACA CCGTATCATTGTACATAAAG 573 GTGAATAAAA ATGGC 588 110 amino acids amino acid linearprotein 57 Tyr Phe His Ile Gly Lys Arg Glu Glu Phe Val Met Lys Ser TyrLys 1 5 10 15 Cys Lys Gly Ser Phe Leu Ile Asp Ser Met Ala Gly Phe LeuLeu Ile 20 25 30 Gly Leu Xaa Thr Leu Leu Leu Ile Pro Met Met Asn Gln MetGln Ala 35 40 45 Ser Ile Asn His Lys Leu Gln Thr Ile Asp Ala Ser Lys ValIle Leu 50 55 60 Thr Thr Val Ser Lys Ile Asn Lys Glu Glu Leu Lys Lys GlyVal Thr 65 70 75 80 Ile Gly Lys Tyr Asp Ile Lys Gln Ser Asp Gln Gln IleCys Ala Ile 85 90 95 Ser Xaa Asn Thr Xaa Ser Tyr Gln Lys Thr Cys Ile GlnTyr 100 105 110 588 base pairs nucleic acid single linear other nucleicacid /desc = ”DNA (genomic) (p8d26)“ CDS 318..587 58 ATTTATAGAATAATATTTCC ATATTGGAAA AAGGGAAGAA TTCGTTATGA AAAGCTATAA 60 GTGTAAAGGTTCATTCTTAA TAGATAGTAT GGCTGGATTT TTGCTAATTG GATTGATAAC 120 ATTACTATTGATACCAATGA TGAATCAAAT GCAAGCGAGT ATAAACCATA AACTACAAAC 180 AATTGATGCTTCTAAAGTAA TTTTGACGAC TGTATCTAAA ATTAATAAAG AAGAACTTAA 240 GAAGGGGGTAACTATAGGGA AGTATGATAT TAAGCAAAGT GACCAACAAA TTTGTGCTAT 300 TTCAAAAAATACCAATT CTT ATC AAA AGA CAT GTA TAC AGT ATA AAT GTC 350 Leu Ile Lys ArgHis Val Tyr Ser Ile Asn Val 1 5 10 AAA GCT TTT TCG CTC ATT GAA ATG TTAGTA GCG ATG ATG GTT ATA AGT 398 Lys Ala Phe Ser Leu Ile Glu Met Leu ValAla Met Met Val Ile Ser 15 20 25 ATA ACT TTA CTA ATT GTT CCA GAC TTA ATTAGA CTT AGT AAA ACT TTT 446 Ile Thr Leu Leu Ile Val Pro Asp Leu Ile ArgLeu Ser Lys Thr Phe 30 35 40 CTA ATT GAA AGT AGG GAT TTA ACA ACT GTA GATTTC GAA TTT TTC TCA 494 Leu Ile Glu Ser Arg Asp Leu Thr Thr Val Asp PheGlu Phe Phe Ser 45 50 55 AGA GAT ATT CTA GAT GAT TTT AAA GGA GTA GAT AGAAAC GAT ATT GAA 542 Arg Asp Ile Leu Asp Asp Phe Lys Gly Val Asp Arg AsnAsp Ile Glu 60 65 70 75 ATT AGG CAA CAC CGT ATC ATT GTA CAT AAA GGT GNNTAA AAA TGG 587 Ile Arg Gln His Arg Ile Ile Val His Lys Gly Xaa Xaa LysTrp 80 85 90 C 588 90 amino acids amino acid linear protein 59 Leu IleLys Arg His Val Tyr Ser Ile Asn Val Lys Ala Phe Ser Leu 1 5 10 15 IleGlu Met Leu Val Ala Met Met Val Ile Ser Ile Thr Leu Leu Ile 20 25 30 ValPro Asp Leu Ile Arg Leu Ser Lys Thr Phe Leu Ile Glu Ser Arg 35 40 45 AspLeu Thr Thr Val Asp Phe Glu Phe Phe Ser Arg Asp Ile Leu Asp 50 55 60 AspPhe Lys Gly Val Asp Arg Asn Asp Ile Glu Ile Arg Gln His Arg 65 70 75 80Ile Ile Val His Lys Gly Xaa Xaa Lys Trp 85 90 1338 base pairs nucleicacid single linear other nucleic acid /desc = ”DNA (genomic) (p9b65)“CDS 1..1335 60 ATG ATC CGG CTT GTG ACG ATG GGG AAA AGT TCT GAG GCC GGGGTG TCT 48 Met Ile Arg Leu Val Thr Met Gly Lys Ser Ser Glu Ala Gly ValSer 1 5 10 15 TCT TTC CAG GCG TTG ACG ATG TCG TTG TCC GGC CGC ATC GGCGTC GGG 96 Ser Phe Gln Ala Leu Thr Met Ser Leu Ser Gly Arg Ile Gly ValGly 20 25 30 AAC GTG GCC GGG ACG GCG ACG GGG ATT GCG TAC GGG GGG CCG GGCGCG 144 Asn Val Ala Gly Thr Ala Thr Gly Ile Ala Tyr Gly Gly Pro Gly Ala35 40 45 GTC TTT TGG ATG TGG GTG ATC ACC TTT ATC GGA GCG GCG ACC GCG TAT192 Val Phe Trp Met Trp Val Ile Thr Phe Ile Gly Ala Ala Thr Ala Tyr 5055 60 GTC GAG TCG ACG TGG CGC AAA TTT ATA AAG AGG AAC AAG ACG GAC AAT240 Val Glu Ser Thr Trp Arg Lys Phe Ile Lys Arg Asn Lys Thr Asp Asn 6570 75 80 ACC GTG GCG GTC CGG CGT TCT ACA TTG AAA AAG GCC TTG GCT GGA AAT288 Thr Val Ala Val Arg Arg Ser Thr Leu Lys Lys Ala Leu Ala Gly Asn 8590 95 GGT TTG CGG TGT AGT CGC GCG GCG ATC ATT CTC TCG ATG GCG GTG CTG336 Gly Leu Arg Cys Ser Arg Ala Ala Ile Ile Leu Ser Met Ala Val Leu 100105 110 ATG CCG GGA ATT CAA GCA AAC TCG ATT GCC GAC AGC TTT TCG AAT GCG384 Met Pro Gly Ile Gln Ala Asn Ser Ile Ala Asp Ser Phe Ser Asn Ala 115120 125 TTT GGC ATT CCG AAA TTG GTG ACG GGA ATT TTC GTG ATT GCC GTT CTT432 Phe Gly Ile Pro Lys Leu Val Thr Gly Ile Phe Val Ile Ala Val Leu 130135 140 GGC TTT ACG ATT TTT GGC GGA GTG AAG CGG ATC GCG AAA ACG GCG GAA480 Gly Phe Thr Ile Phe Gly Gly Val Lys Arg Ile Ala Lys Thr Ala Glu 145150 155 160 ATT GTC GTG CCG TTT ATG GCA GTT GGC TAT TTG TTC GTC GCG ATTGCC 528 Ile Val Val Pro Phe Met Ala Val Gly Tyr Leu Phe Val Ala Ile Ala165 170 175 ATT ATT GCG GCC AAT ATT GAA AAA GTC CCG GAT GTG TTT GGT TTGATT 576 Ile Ile Ala Ala Asn Ile Glu Lys Val Pro Asp Val Phe Gly Leu Ile180 185 190 TTC AAA AGC GCG TTT GGC GCT GAT CAA GTG TTT GGC GGC ATT CTTGGT 624 Phe Lys Ser Ala Phe Gly Ala Asp Gln Val Phe Gly Gly Ile Leu Gly195 200 205 TCG GCG GTG ATG TGG GGG GTC AAA CGC GGC CTT TAT GCG AAT GAAGCG 672 Ser Ala Val Met Trp Gly Val Lys Arg Gly Leu Tyr Ala Asn Glu Ala210 215 220 GGG CAA GGG ACG GGC GCC CAC CCG GCA GCG GCG GCG GAA GTG TCCCAC 720 Gly Gln Gly Thr Gly Ala His Pro Ala Ala Ala Ala Glu Val Ser His225 230 235 240 CCG GCG AAG CAG GGG CTT GTG CAG GCA TTT TCG ATC TAT TTGGAC GTG 768 Pro Ala Lys Gln Gly Leu Val Gln Ala Phe Ser Ile Tyr Leu AspVal 245 250 255 TTC TTG GTC GTG ACG GCG ACG GCG CTG ATG ATT TTG TTT ACGGGT CAA 816 Phe Leu Val Val Thr Ala Thr Ala Leu Met Ile Leu Phe Thr GlyGln 260 265 270 TAC AAT GTG ATC AAT GAA AAA ACG GGA GAG ACG ATT GTC GAGCAT TTG 864 Tyr Asn Val Ile Asn Glu Lys Thr Gly Glu Thr Ile Val Glu HisLeu 275 280 285 AAA GGG GTG GAA CCG GGC GCA GGG TAT ACG CAG GCG GCG GTGGAC ACG 912 Lys Gly Val Glu Pro Gly Ala Gly Tyr Thr Gln Ala Ala Val AspThr 290 295 300 CTC TTC CCG GGA TTC GGG TCG GCC TTT ATT GCG ATC GCT CTGTTC TTC 960 Leu Phe Pro Gly Phe Gly Ser Ala Phe Ile Ala Ile Ala Leu PhePhe 305 310 315 320 TTC GCG TTT ACG ACG ATG TAC GCG TAT TAC TAT ATT GCCGAG ACG AAC 1008 Phe Ala Phe Thr Thr Met Tyr Ala Tyr Tyr Tyr Ile Ala GluThr Asn 325 330 335 CTC GCC TAT TTG GTG CGC AGT GAA AAG AGG GGA ACG GCCTTC TTT GCC 1056 Leu Ala Tyr Leu Val Arg Ser Glu Lys Arg Gly Thr Ala PhePhe Ala 340 345 350 TTG AAG CTC GTC TTT TTG GCG GCC ACG TTC TAT GGA ACGGTC AAA ACG 1104 Leu Lys Leu Val Phe Leu Ala Ala Thr Phe Tyr Gly Thr ValLys Thr 355 360 365 GCG ACG ACG GCG TGG GCG ATG GGC GAC ATC GGG CTT GGCATC ATG GTG 1152 Ala Thr Thr Ala Trp Ala Met Gly Asp Ile Gly Leu Gly IleMet Val 370 375 380 TGG CTC AAC TTG ATT GCG ATC TTG TTG TTG TTT AAA CCGGCC TAT ATG 1200 Trp Leu Asn Leu Ile Ala Ile Leu Leu Leu Phe Lys Pro AlaTyr Met 385 390 395 400 GCC TTG AAA GAT TAT GAA GAA CAG CTG AAG CAA GGGAAA GAT CCG GAG 1248 Ala Leu Lys Asp Tyr Glu Glu Gln Leu Lys Gln Gly LysAsp Pro Glu 405 410 415 TTC AAC GCG TCG AAA TAC GGA ATC AAG AAC GCG AAATTC TGG GAA AAT 1296 Phe Asn Ala Ser Lys Tyr Gly Ile Lys Asn Ala Lys PheTrp Glu Asn 420 425 430 GGA TAT AAG AGA TGG GAA GAA AAG AAA GGG AAG GCATTG TAA 1338 Gly Tyr Lys Arg Trp Glu Glu Lys Lys Gly Lys Ala Leu 435 440445 445 amino acids amino acid linear protein 61 Met Ile Arg Leu Val ThrMet Gly Lys Ser Ser Glu Ala Gly Val Ser 1 5 10 15 Ser Phe Gln Ala LeuThr Met Ser Leu Ser Gly Arg Ile Gly Val Gly 20 25 30 Asn Val Ala Gly ThrAla Thr Gly Ile Ala Tyr Gly Gly Pro Gly Ala 35 40 45 Val Phe Trp Met TrpVal Ile Thr Phe Ile Gly Ala Ala Thr Ala Tyr 50 55 60 Val Glu Ser Thr TrpArg Lys Phe Ile Lys Arg Asn Lys Thr Asp Asn 65 70 75 80 Thr Val Ala ValArg Arg Ser Thr Leu Lys Lys Ala Leu Ala Gly Asn 85 90 95 Gly Leu Arg CysSer Arg Ala Ala Ile Ile Leu Ser Met Ala Val Leu 100 105 110 Met Pro GlyIle Gln Ala Asn Ser Ile Ala Asp Ser Phe Ser Asn Ala 115 120 125 Phe GlyIle Pro Lys Leu Val Thr Gly Ile Phe Val Ile Ala Val Leu 130 135 140 GlyPhe Thr Ile Phe Gly Gly Val Lys Arg Ile Ala Lys Thr Ala Glu 145 150 155160 Ile Val Val Pro Phe Met Ala Val Gly Tyr Leu Phe Val Ala Ile Ala 165170 175 Ile Ile Ala Ala Asn Ile Glu Lys Val Pro Asp Val Phe Gly Leu Ile180 185 190 Phe Lys Ser Ala Phe Gly Ala Asp Gln Val Phe Gly Gly Ile LeuGly 195 200 205 Ser Ala Val Met Trp Gly Val Lys Arg Gly Leu Tyr Ala AsnGlu Ala 210 215 220 Gly Gln Gly Thr Gly Ala His Pro Ala Ala Ala Ala GluVal Ser His 225 230 235 240 Pro Ala Lys Gln Gly Leu Val Gln Ala Phe SerIle Tyr Leu Asp Val 245 250 255 Phe Leu Val Val Thr Ala Thr Ala Leu MetIle Leu Phe Thr Gly Gln 260 265 270 Tyr Asn Val Ile Asn Glu Lys Thr GlyGlu Thr Ile Val Glu His Leu 275 280 285 Lys Gly Val Glu Pro Gly Ala GlyTyr Thr Gln Ala Ala Val Asp Thr 290 295 300 Leu Phe Pro Gly Phe Gly SerAla Phe Ile Ala Ile Ala Leu Phe Phe 305 310 315 320 Phe Ala Phe Thr ThrMet Tyr Ala Tyr Tyr Tyr Ile Ala Glu Thr Asn 325 330 335 Leu Ala Tyr LeuVal Arg Ser Glu Lys Arg Gly Thr Ala Phe Phe Ala 340 345 350 Leu Lys LeuVal Phe Leu Ala Ala Thr Phe Tyr Gly Thr Val Lys Thr 355 360 365 Ala ThrThr Ala Trp Ala Met Gly Asp Ile Gly Leu Gly Ile Met Val 370 375 380 TrpLeu Asn Leu Ile Ala Ile Leu Leu Leu Phe Lys Pro Ala Tyr Met 385 390 395400 Ala Leu Lys Asp Tyr Glu Glu Gln Leu Lys Gln Gly Lys Asp Pro Glu 405410 415 Phe Asn Ala Ser Lys Tyr Gly Ile Lys Asn Ala Lys Phe Trp Glu Asn420 425 430 Gly Tyr Lys Arg Trp Glu Glu Lys Lys Gly Lys Ala Leu 435 440445 296 base pairs nucleic acid single linear other nucleic acid /desc =”DNA (genomic) (p10b32)“ CDS 1..294 62 ATG CAA ATA GAA CTT ACT GAT GCAGCA GTA ACT TGG TTT AAA AAT GAA 48 Met Gln Ile Glu Leu Thr Asp Ala AlaVal Thr Trp Phe Lys Asn Glu 1 5 10 15 CTT GAG TTG CCT GAA AAT AAT AAAGTG CTC GTG TTT TTT GTA AGA TAT 96 Leu Glu Leu Pro Glu Asn Asn Lys ValLeu Val Phe Phe Val Arg Tyr 20 25 30 GGT GGC GAA TTC CAA CTC AAG CAA GGATTT AGT CCT GCT TTT ACA GTT 144 Gly Gly Glu Phe Gln Leu Lys Gln Gly PheSer Pro Ala Phe Thr Val 35 40 45 GAA CCA AAG GAA GAT GTT GAT ATT GGC TATGAA CAA CAA TAT GAC GAT 192 Glu Pro Lys Glu Asp Val Asp Ile Gly Tyr GluGln Gln Tyr Asp Asp 50 55 60 TTA AAT GTT GTC GTA GCG GAA AAA GAT TTG TGGTAC TTT GAA GAT GAC 240 Leu Asn Val Val Val Ala Glu Lys Asp Leu Trp TyrPhe Glu Asp Asp 65 70 75 80 CAC ATT ATT GTA AAT GTA GTT GTC ACG AAG ATGAAT TTC TTA TTC CAC 288 His Ile Ile Val Asn Val Val Val Thr Lys Met AsnPhe Leu Phe His 85 90 95 CAA ATA AC 296 Gln Ile 98 amino acids aminoacid linear protein 63 Met Gln Ile Glu Leu Thr Asp Ala Ala Val Thr TrpPhe Lys Asn Glu 1 5 10 15 Leu Glu Leu Pro Glu Asn Asn Lys Val Leu ValPhe Phe Val Arg Tyr 20 25 30 Gly Gly Glu Phe Gln Leu Lys Gln Gly Phe SerPro Ala Phe Thr Val 35 40 45 Glu Pro Lys Glu Asp Val Asp Ile Gly Tyr GluGln Gln Tyr Asp Asp 50 55 60 Leu Asn Val Val Val Ala Glu Lys Asp Leu TrpTyr Phe Glu Asp Asp 65 70 75 80 His Ile Ile Val Asn Val Val Val Thr LysMet Asn Phe Leu Phe His 85 90 95 Gln Ile 433 base pairs nucleic acidsingle linear other nucleic acid /desc = ”DNA(genomic) (p10b85)“ 64GNAACACTNC TNTTTCATTT GTGAATAATT TCACAATTAT TTTATCCTTT GGTTATGAAT 60ATGTAAATAC CATTAATTAA TTATTCATAT TTTATCTATA ATTTGTTAAT AAATATTTAA 120TATATTATNN TTATNCTTAA ATATTANGNG CAATTATACC ACTATATTTA TTTCTTATAC 180TCCATTTGAT TTATAGTTGT ATTAATACAT TGACTCAAAA ACTAATTAAT CAAATATGTT 240TTTTAGATTN ATAAAGTTGT AACTGTACTA TTTNGTAGTG TANGGTAATT TATTNGGATG 300AAATATAATT CNNNNTACTC TATNGATTAA TCAAATATGT ATCTATCAAA ATTCGGCTTA 360TTTATNCACT CTGACATATC CANATACGCA AAAAGACTAT NNCTACCTTG TATCGANAGA 420AATAGTCTTT TTA 433 208 base pairs nucleic acid single linear othernucleic acid /desc = ”DNA (genomic) (p10b89)“ CDS 1..207 65 TAC TAT AAAAGC ACC CAC TCA GTC ACT AGT TTG GGC AGT TAT TGT ATG 48 Tyr Tyr Lys SerThr His Ser Val Thr Ser Leu Gly Ser Tyr Cys Met 1 5 10 15 CCT ATT GAACTC AAT GCG TAT ATT ACA ATA CCT TTT TCG CAT ATT CAT 96 Pro Ile Glu LeuAsn Ala Tyr Ile Thr Ile Pro Phe Ser His Ile His 20 25 30 ATA AGA TCT TTGCAT TCN TTA AGC TTA AAT TTT CTA TTC TTC NTT CTC 144 Ile Arg Ser Leu HisSer Leu Ser Leu Asn Phe Leu Phe Phe Xaa Leu 35 40 45 TAC GGC GGC ATA GCATTA ATA TTA CCG GAA CTA ATC CCA GTA NCN GTA 192 Tyr Gly Gly Ile Ala LeuIle Leu Pro Glu Leu Ile Pro Val Xaa Val 50 55 60 TTA ATT GGA TAC CCG G208 Leu Ile Gly Tyr Pro 65 69 amino acids amino acid linear protein 66Tyr Tyr Lys Ser Thr His Ser Val Thr Ser Leu Gly Ser Tyr Cys Met 1 5 1015 Pro Ile Glu Leu Asn Ala Tyr Ile Thr Ile Pro Phe Ser His Ile His 20 2530 Ile Arg Ser Leu His Ser Leu Ser Leu Asn Phe Leu Phe Phe Xaa Leu 35 4045 Tyr Gly Gly Ile Ala Leu Ile Leu Pro Glu Leu Ile Pro Val Xaa Val 50 5560 Leu Ile Gly Tyr Pro 65 1041 base pairs nucleic acid single linearother nucleic acid /desc = ”DNA (genomic) (p10c30)“ CDS 1..1038 67 AGAAGT CAA ATC ATT ANT GGC GTC NTA TCG AGT ATA CTA TTA ACT TCA 48 Arg SerGln Ile Ile Xaa Gly Val Xaa Ser Ser Ile Leu Leu Thr Ser 1 5 10 15 ACTATT TTA GCA ATT GCA TAT ATT TTA ATG TGG TTT AAC GGC CAT ATG 96 Thr IleLeu Ala Ile Ala Tyr Ile Leu Met Trp Phe Asn Gly His Met 20 25 30 ACA CTAACT TTG ACC TTA ACG ACA ATA ATT ACA AGC TGT TTA ACC TTA 144 Thr Leu ThrLeu Thr Leu Thr Thr Ile Ile Thr Ser Cys Leu Thr Leu 35 40 45 TTA ATA TGTAGT ATT TTT ATT AAT CCA CTT ATA CAA AAA ATT AAG CAG 192 Leu Ile Cys SerIle Phe Ile Asn Pro Leu Ile Gln Lys Ile Lys Gln 50 55 60 TTT AAT ATA AAAACT AAG CAA TTT GCT AAC GGA AAT TAC GCA AGC AAT 240 Phe Asn Ile Lys ThrLys Gln Phe Ala Asn Gly Asn Tyr Ala Ser Asn 65 70 75 80 GAT AAA ACG TTTAAT TCA CCA AAA GAA ATT TAT GAA TTA AAT CAA TCT 288 Asp Lys Thr Phe AsnSer Pro Lys Glu Ile Tyr Glu Leu Asn Gln Ser 85 90 95 TTT AAT AAA ATG GCTTCT GAA ATT ACG CAA CAA ATG AAT CAA ATT AAA 336 Phe Asn Lys Met Ala SerGlu Ile Thr Gln Gln Met Asn Gln Ile Lys 100 105 110 TCC GAA CAA CAA GAAAAA ACA GAA CTG ATT CAA AAC TTA GCC CAT GAT 384 Ser Glu Gln Gln Glu LysThr Glu Leu Ile Gln Asn Leu Ala His Asp 115 120 125 TTA AAA ACA CCT TTAGCA AGC ATT ATT TCA TAT TCT GAA GGA CTA CGT 432 Leu Lys Thr Pro Leu AlaSer Ile Ile Ser Tyr Ser Glu Gly Leu Arg 130 135 140 GAT GGT ATA ATC ACTAAG GAT CAT GAG ATT AAA GAG TCA TAC GAC ATA 480 Asp Gly Ile Ile Thr LysAsp His Glu Ile Lys Glu Ser Tyr Asp Ile 145 150 155 160 TTA ATT AAA CAAGCA AAC AGA TTA TCA ACA TTA TTT GAT GAT ATG ACT 528 Leu Ile Lys Gln AlaAsn Arg Leu Ser Thr Leu Phe Asp Asp Met Thr 165 170 175 CAT ATT ATC ACTTTA AAT ACA GGT AAA ACA TAT CCC CCA GAA TTA ATA 576 His Ile Ile Thr LeuAsn Thr Gly Lys Thr Tyr Pro Pro Glu Leu Ile 180 185 190 CAA CTA GAC CAATTA CTT GTA TCA ATA TTG CAA CCA TAT GAG CAA CGT 624 Gln Leu Asp Gln LeuLeu Val Ser Ile Leu Gln Pro Tyr Glu Gln Arg 195 200 205 ATC AAA CAT GAAAAC CGC ACA TTA GAA GTG AAT TTC TGT AAC GAA ATT 672 Ile Lys His Glu AsnArg Thr Leu Glu Val Asn Phe Cys Asn Glu Ile 210 215 220 GAT GCA TTT TATCAA TAT CGA ACG CCA CTT GAG CGT ATT TTA ACA AAC 720 Asp Ala Phe Tyr GlnTyr Arg Thr Pro Leu Glu Arg Ile Leu Thr Asn 225 230 235 240 TTA CTT GATAAT GCG CTA AAA TTT TCA AAT GTT GGT AGT CGC ATT GAT 768 Leu Leu Asp AsnAla Leu Lys Phe Ser Asn Val Gly Ser Arg Ile Asp 245 250 255 ATT AAT ATTAGT GAA AAC GAA GAT CAA GAT ACT ATC GAC ATT GCT ATT 816 Ile Asn Ile SerGlu Asn Glu Asp Gln Asp Thr Ile Asp Ile Ala Ile 260 265 270 AGC GAT GAAGGT ATT GGC ATT ATA CCA GAA CTA CAA GAA CGT ATA TTC 864 Ser Asp Glu GlyIle Gly Ile Ile Pro Glu Leu Gln Glu Arg Ile Phe 275 280 285 GAA CGT ACATTC AGA GTA GAA AAC TCT CGT AAT ACA AAA ACG GGT GGT 912 Glu Arg Thr PheArg Val Glu Asn Ser Arg Asn Thr Lys Thr Gly Gly 290 295 300 TCT GGA TTAGGC TTA TAT ATA GCT AAT GAA CTC GCG CAA CAA AAT AAC 960 Ser Gly Leu GlyLeu Tyr Ile Ala Asn Glu Leu Ala Gln Gln Asn Asn 305 310 315 320 GCA AAAATC AGT GTA AGC AGT GAT ATA GAT GTA GGA ACT ACG ATG ACT 1008 Ala Lys IleSer Val Ser Ser Asp Ile Asp Val Gly Thr Thr Met Thr 325 330 335 GTA ACATTA CAC AAA TTA GAC ATT ACG TCA TAA 1041 Val Thr Leu His Lys Leu Asp IleThr Ser 340 345 346 amino acids amino acid linear protein 68 Arg Ser GlnIle Ile Xaa Gly Val Xaa Ser Ser Ile Leu Leu Thr Ser 1 5 10 15 Thr IleLeu Ala Ile Ala Tyr Ile Leu Met Trp Phe Asn Gly His Met 20 25 30 Thr LeuThr Leu Thr Leu Thr Thr Ile Ile Thr Ser Cys Leu Thr Leu 35 40 45 Leu IleCys Ser Ile Phe Ile Asn Pro Leu Ile Gln Lys Ile Lys Gln 50 55 60 Phe AsnIle Lys Thr Lys Gln Phe Ala Asn Gly Asn Tyr Ala Ser Asn 65 70 75 80 AspLys Thr Phe Asn Ser Pro Lys Glu Ile Tyr Glu Leu Asn Gln Ser 85 90 95 PheAsn Lys Met Ala Ser Glu Ile Thr Gln Gln Met Asn Gln Ile Lys 100 105 110Ser Glu Gln Gln Glu Lys Thr Glu Leu Ile Gln Asn Leu Ala His Asp 115 120125 Leu Lys Thr Pro Leu Ala Ser Ile Ile Ser Tyr Ser Glu Gly Leu Arg 130135 140 Asp Gly Ile Ile Thr Lys Asp His Glu Ile Lys Glu Ser Tyr Asp Ile145 150 155 160 Leu Ile Lys Gln Ala Asn Arg Leu Ser Thr Leu Phe Asp AspMet Thr 165 170 175 His Ile Ile Thr Leu Asn Thr Gly Lys Thr Tyr Pro ProGlu Leu Ile 180 185 190 Gln Leu Asp Gln Leu Leu Val Ser Ile Leu Gln ProTyr Glu Gln Arg 195 200 205 Ile Lys His Glu Asn Arg Thr Leu Glu Val AsnPhe Cys Asn Glu Ile 210 215 220 Asp Ala Phe Tyr Gln Tyr Arg Thr Pro LeuGlu Arg Ile Leu Thr Asn 225 230 235 240 Leu Leu Asp Asn Ala Leu Lys PheSer Asn Val Gly Ser Arg Ile Asp 245 250 255 Ile Asn Ile Ser Glu Asn GluAsp Gln Asp Thr Ile Asp Ile Ala Ile 260 265 270 Ser Asp Glu Gly Ile GlyIle Ile Pro Glu Leu Gln Glu Arg Ile Phe 275 280 285 Glu Arg Thr Phe ArgVal Glu Asn Ser Arg Asn Thr Lys Thr Gly Gly 290 295 300 Ser Gly Leu GlyLeu Tyr Ile Ala Asn Glu Leu Ala Gln Gln Asn Asn 305 310 315 320 Ala LysIle Ser Val Ser Ser Asp Ile Asp Val Gly Thr Thr Met Thr 325 330 335 ValThr Leu His Lys Leu Asp Ile Thr Ser 340 345 392 base pairs nucleic acidsingle linear other nucleic acid /desc = ”DNA (genomic) (p10c52)“ CDS1..390 69 GTG AAC TAT GAA TAT TTC GAA TAT TCG GTT AAT TTA GGT GGT ACCACG 48 Val Asn Tyr Glu Tyr Phe Glu Tyr Ser Val Asn Leu Gly Gly Thr Thr 15 10 15 YGT CAC GTC CTT TAT ATT GAT AAG GAT GCT GGC GCT TTT TTG AAA GGA96 Xaa His Val Leu Tyr Ile Asp Lys Asp Ala Gly Ala Phe Leu Lys Gly 20 2530 GCG TAT AGA ATG GAT ATA TTT TAT AAA AAA ATA AAA GCA AAT GTA ACG 144Ala Tyr Arg Met Asp Ile Phe Tyr Lys Lys Ile Lys Ala Asn Val Thr 35 40 45CCC GAA GTT TTA GCA CAA CTT CAT TCC AAG AAG ATC ATT TTG GAA AGT 192 ProGlu Val Leu Ala Gln Leu His Ser Lys Lys Ile Ile Leu Glu Ser 50 55 60 ACAAAT CAA CAA CAA ACT AAA GGT CGC TAT TCA GTT GTT ATT TTT GAT 240 Thr AsnGln Gln Gln Thr Lys Gly Arg Tyr Ser Val Val Ile Phe Asp 65 70 75 80 ATTTAT GGC ACT TTA ACT TTA GAT AAT GAT GTA TTA TCA GTA AGT ACT 288 Ile TyrGly Thr Leu Thr Leu Asp Asn Asp Val Leu Ser Val Ser Thr 85 90 95 TTA AAAGAA TCG TAT CAA ATC ACT GAA AGA CCG TAC CAT TAT TTA ACG 336 Leu Lys GluSer Tyr Gln Ile Thr Glu Arg Pro Tyr His Tyr Leu Thr 100 105 110 ACT AANATA AAT GAA GAC TAC CAT AAT ATT CCA AGA TGA GGC AAC TTA 384 Thr Xaa IleAsn Glu Asp Tyr His Asn Ile Pro Arg Xaa Gly Asn Leu 115 120 125 AGT CATTA 392 Ser His 130 130 amino acids amino acid linear protein 70 Val AsnTyr Glu Tyr Phe Glu Tyr Ser Val Asn Leu Gly Gly Thr Thr 1 5 10 15 XaaHis Val Leu Tyr Ile Asp Lys Asp Ala Gly Ala Phe Leu Lys Gly 20 25 30 AlaTyr Arg Met Asp Ile Phe Tyr Lys Lys Ile Lys Ala Asn Val Thr 35 40 45 ProGlu Val Leu Ala Gln Leu His Ser Lys Lys Ile Ile Leu Glu Ser 50 55 60 ThrAsn Gln Gln Gln Thr Lys Gly Arg Tyr Ser Val Val Ile Phe Asp 65 70 75 80Ile Tyr Gly Thr Leu Thr Leu Asp Asn Asp Val Leu Ser Val Ser Thr 85 90 95Leu Lys Glu Ser Tyr Gln Ile Thr Glu Arg Pro Tyr His Tyr Leu Thr 100 105110 Thr Xaa Ile Asn Glu Asp Tyr His Asn Ile Pro Arg Xaa Gly Asn Leu 115120 125 Ser His 130 1071 base pairs nucleic acid single linear othernucleic acid /desc = ”DNA (genomic) (p10d9)“ CDS 1..1068 71 ATG AAA ATGAGA ACA ATT GCT AAA ACC AGT TTA GCA CTA GGG CTT TTA 48 Met Lys Met ArgThr Ile Ala Lys Thr Ser Leu Ala Leu Gly Leu Leu 1 5 10 15 ACA ACA GGCGCA ATT ACA GTA ACG ACG CAA TCG GTC AAA GCA GAA AAA 96 Thr Thr Gly AlaIle Thr Val Thr Thr Gln Ser Val Lys Ala Glu Lys 20 25 30 ATA CAA TCA ACTAAA GTT GAC AAA GTA CCA ACG CTT AAA GCA GAG CGA 144 Ile Gln Ser Thr LysVal Asp Lys Val Pro Thr Leu Lys Ala Glu Arg 35 40 45 TTA GCA ATG ATA AACATA ACA GCA GGT GCA AAT TCA GCG ACA ACA CAA 192 Leu Ala Met Ile Asn IleThr Ala Gly Ala Asn Ser Ala Thr Thr Gln 50 55 60 GCA GCT AAC ACA AGA CAAGAA CGC ACG CCT AAA CTC GAA AAG GCA CCA 240 Ala Ala Asn Thr Arg Gln GluArg Thr Pro Lys Leu Glu Lys Ala Pro 65 70 75 80 AAT ACT AAT GAG GAA AAAACC TCA GCT TCC AAA ATA GAA AAA ATA TCA 288 Asn Thr Asn Glu Glu Lys ThrSer Ala Ser Lys Ile Glu Lys Ile Ser 85 90 95 CAA CCT AAA CAA GAA GAG CAGAAA ACG CTT AAT ATA TCA GCA ACG CCA 336 Gln Pro Lys Gln Glu Glu Gln LysThr Leu Asn Ile Ser Ala Thr Pro 100 105 110 GCG CCT AAA CAA GAA CAA TCACAA ACG ACA ACC GAA TCC ACA ACG CCG 384 Ala Pro Lys Gln Glu Gln Ser GlnThr Thr Thr Glu Ser Thr Thr Pro 115 120 125 AAA ACT AAA GTG ACA ACA CCTCCA TCA ACA AAC ACG CCA CAA CCA ATG 432 Lys Thr Lys Val Thr Thr Pro ProSer Thr Asn Thr Pro Gln Pro Met 130 135 140 CAA TCT ACT AAA TCA GAC ACACCA CAA TCT CCA ACC ATA AAA CAA GCA 480 Gln Ser Thr Lys Ser Asp Thr ProGln Ser Pro Thr Ile Lys Gln Ala 145 150 155 160 CAA ACA GAT ATG ACT CCTAAA TAT GAA GAT TTA AGA GCG TAT TAT ACA 528 Gln Thr Asp Met Thr Pro LysTyr Glu Asp Leu Arg Ala Tyr Tyr Thr 165 170 175 AAA CCG AGT TTT GAA TTTGAA AAG CAG TTT GGA TTT ATG CTC AAA CCA 576 Lys Pro Ser Phe Glu Phe GluLys Gln Phe Gly Phe Met Leu Lys Pro 180 185 190 TGG ACG ACG GTT AGG TTTATG AAT GTT ATT CCA AAT AGG TTC ATC TAT 624 Trp Thr Thr Val Arg Phe MetAsn Val Ile Pro Asn Arg Phe Ile Tyr 195 200 205 AAA ATA GCT TTA GTT GGAAAA GAT GAG AAA AAA TAT AAA GAT GGA CCT 672 Lys Ile Ala Leu Val Gly LysAsp Glu Lys Lys Tyr Lys Asp Gly Pro 210 215 220 TAC GAT AAT ATC GAT GTATTT ATC GTT TTA GAA GAC AAT AAA TAT CAA 720 Tyr Asp Asn Ile Asp Val PheIle Val Leu Glu Asp Asn Lys Tyr Gln 225 230 235 240 TTG AAA AAA TAT TCTGTC GGT GGC ATC ACG AAG ACT AAT AGT AAA AAA 768 Leu Lys Lys Tyr Ser ValGly Gly Ile Thr Lys Thr Asn Ser Lys Lys 245 250 255 GTT AAT CAC AAA GTAGAA TTA AGC ATT ACT AAA AAA GAT AAT CAA GGT 816 Val Asn His Lys Val GluLeu Ser Ile Thr Lys Lys Asp Asn Gln Gly 260 265 270 ATG ATT TCA CGC GATGTT TCA GAA TAC ATG ATT ACT AAG GAA GAG ATT 864 Met Ile Ser Arg Asp ValSer Glu Tyr Met Ile Thr Lys Glu Glu Ile 275 280 285 TCC TTG AAA GAG CTTGAT TTT AAA TTG AGA AAA CAA CTT ATT GAA AAA 912 Ser Leu Lys Glu Leu AspPhe Lys Leu Arg Lys Gln Leu Ile Glu Lys 290 295 300 CAT AAT CTT TAC GGTAAC ATG GGT TCA GGA ACA ATC GTT ATT AAA ATG 960 His Asn Leu Tyr Gly AsnMet Gly Ser Gly Thr Ile Val Ile Lys Met 305 310 315 320 AAA AAC GGT GGGAAA TAT ACG TTT GAA TTA CAC AAA AAA CTG CAA GAG 1008 Lys Asn Gly Gly LysTyr Thr Phe Glu Leu His Lys Lys Leu Gln Glu 325 330 335 CAT CGT ATG GCAGAC GTC ATA GAT GGC ACT AAT ATT GAT AAC ATT GAA 1056 His Arg Met Ala AspVal Ile Asp Gly Thr Asn Ile Asp Asn Ile Glu 340 345 350 GTG AAT ATA AAATAA 1071 Val Asn Ile Lys 355 356 amino acids amino acid linear protein72 Met Lys Met Arg Thr Ile Ala Lys Thr Ser Leu Ala Leu Gly Leu Leu 1 510 15 Thr Thr Gly Ala Ile Thr Val Thr Thr Gln Ser Val Lys Ala Glu Lys 2025 30 Ile Gln Ser Thr Lys Val Asp Lys Val Pro Thr Leu Lys Ala Glu Arg 3540 45 Leu Ala Met Ile Asn Ile Thr Ala Gly Ala Asn Ser Ala Thr Thr Gln 5055 60 Ala Ala Asn Thr Arg Gln Glu Arg Thr Pro Lys Leu Glu Lys Ala Pro 6570 75 80 Asn Thr Asn Glu Glu Lys Thr Ser Ala Ser Lys Ile Glu Lys Ile Ser85 90 95 Gln Pro Lys Gln Glu Glu Gln Lys Thr Leu Asn Ile Ser Ala Thr Pro100 105 110 Ala Pro Lys Gln Glu Gln Ser Gln Thr Thr Thr Glu Ser Thr ThrPro 115 120 125 Lys Thr Lys Val Thr Thr Pro Pro Ser Thr Asn Thr Pro GlnPro Met 130 135 140 Gln Ser Thr Lys Ser Asp Thr Pro Gln Ser Pro Thr IleLys Gln Ala 145 150 155 160 Gln Thr Asp Met Thr Pro Lys Tyr Glu Asp LeuArg Ala Tyr Tyr Thr 165 170 175 Lys Pro Ser Phe Glu Phe Glu Lys Gln PheGly Phe Met Leu Lys Pro 180 185 190 Trp Thr Thr Val Arg Phe Met Asn ValIle Pro Asn Arg Phe Ile Tyr 195 200 205 Lys Ile Ala Leu Val Gly Lys AspGlu Lys Lys Tyr Lys Asp Gly Pro 210 215 220 Tyr Asp Asn Ile Asp Val PheIle Val Leu Glu Asp Asn Lys Tyr Gln 225 230 235 240 Leu Lys Lys Tyr SerVal Gly Gly Ile Thr Lys Thr Asn Ser Lys Lys 245 250 255 Val Asn His LysVal Glu Leu Ser Ile Thr Lys Lys Asp Asn Gln Gly 260 265 270 Met Ile SerArg Asp Val Ser Glu Tyr Met Ile Thr Lys Glu Glu Ile 275 280 285 Ser LeuLys Glu Leu Asp Phe Lys Leu Arg Lys Gln Leu Ile Glu Lys 290 295 300 HisAsn Leu Tyr Gly Asn Met Gly Ser Gly Thr Ile Val Ile Lys Met 305 310 315320 Lys Asn Gly Gly Lys Tyr Thr Phe Glu Leu His Lys Lys Leu Gln Glu 325330 335 His Arg Met Ala Asp Val Ile Asp Gly Thr Asn Ile Asp Asn Ile Glu340 345 350 Val Asn Ile Lys 355 1590 base pairs nucleic acid singlelinear other nucleic acid /desc = ”DNA (genomic) (p11c12)“ CDS 1..158773 AGA GCA AAC CCT AGA AAA GGA GGT GTT TGT GTG AAT TTA TTA AGC CTC 48Arg Ala Asn Pro Arg Lys Gly Gly Val Cys Val Asn Leu Leu Ser Leu 1 5 1015 CTA CTC ATT TTG CTG GGG ATC ATT CTA GGA GTT GTT GGA GGG TAT GTT 96Leu Leu Ile Leu Leu Gly Ile Ile Leu Gly Val Val Gly Gly Tyr Val 20 25 30GTT GCC CGA AAT TTG TTG CTT CAA AAG CAA TCA CAA GCT AGA CAA ACT 144 ValAla Arg Asn Leu Leu Leu Gln Lys Gln Ser Gln Ala Arg Gln Thr 35 40 45 GCCGAA GAT ATT GTA AAT CAA GCA CAT AAA GAA GCT GAC AAT ATC AAA 192 Ala GluAsp Ile Val Asn Gln Ala His Lys Glu Ala Asp Asn Ile Lys 50 55 60 AAA GAGAAA TTA CTT GAG GCA AAA GAA GAA AAC CAA ATC CTA AGA GAA 240 Lys Glu LysLeu Leu Glu Ala Lys Glu Glu Asn Gln Ile Leu Arg Glu 65 70 75 80 CAA ACTGAA GCA GAA CTA CGA GAA AGA CGT AGC GAA CTT CAA AGA CAA 288 Gln Thr GluAla Glu Leu Arg Glu Arg Arg Ser Glu Leu Gln Arg Gln 85 90 95 GAA ACC CGACTT CTT CAA AAA GAA GAA AAC TTA GAG CGC AAA TCT GAT 336 Glu Thr Arg LeuLeu Gln Lys Glu Glu Asn Leu Glu Arg Lys Ser Asp 100 105 110 CTA TTA GATAAA AAA GAT GAG ATT TTA GAG CAA AAA GAA TCA AAA ATT 384 Leu Leu Asp LysLys Asp Glu Ile Leu Glu Gln Lys Glu Ser Lys Ile 115 120 125 GAA GAA AAACAA CAA CAA GTA GAT GCA AAA GAG AGT AGT GTT CAA ACG 432 Glu Glu Lys GlnGln Gln Val Asp Ala Lys Glu Ser Ser Val Gln Thr 130 135 140 TTA ATA ATGAAG CAT GAA CAA GAA TTA GAA CGC ATC TCC GGT CTC ACT 480 Leu Ile Met LysHis Glu Gln Glu Leu Glu Arg Ile Ser Gly Leu Thr 145 150 155 160 CAA GAAGAA GCT ATT AAT GAG CAA CTT CAA AGA GTA GAG GAA GAA CTG 528 Gln Glu GluAla Ile Asn Glu Gln Leu Gln Arg Val Glu Glu Glu Leu 165 170 175 TCA CAAGAT ATT GCA GTA CTT GTT AAA GAA AAA GAA AAA GAA GCT AAA 576 Ser Gln AspIle Ala Val Leu Val Lys Glu Lys Glu Lys Glu Ala Lys 180 185 190 GAA AAAGTT GAT AAA ACA GCA AAA GAA TTA TTA GCT ACA GCA GTA CAA 624 Glu Lys ValAsp Lys Thr Ala Lys Glu Leu Leu Ala Thr Ala Val Gln 195 200 205 AGA TTAGCA GCA GAT CAC ACA AGT GAA TCA ACG GTA TCA GTA GTT AAC 672 Arg Leu AlaAla Asp His Thr Ser Glu Ser Thr Val Ser Val Val Asn 210 215 220 TTA CCTAAT GAT GAG ATG AAA GGT CGA ATC ATT GGA CGA GAA GGA CGA 720 Leu Pro AsnAsp Glu Met Lys Gly Arg Ile Ile Gly Arg Glu Gly Arg 225 230 235 240 AACATC CGC ACA CTT GAA ACT TTA ACT GGC ATT GAT TTA ATT ATT GAT 768 Asn IleArg Thr Leu Glu Thr Leu Thr Gly Ile Asp Leu Ile Ile Asp 245 250 255 GACACA CCA GAA GCG GTT ATA TTA TCT GGT TTT GAT CCA ATA AGA AGA 816 Asp ThrPro Glu Ala Val Ile Leu Ser Gly Phe Asp Pro Ile Arg Arg 260 265 270 GAAATT GCT AGA ACA GCA CTT GTT AAC TTA GTA TCT GAT GGA CGT ATT 864 Glu IleAla Arg Thr Ala Leu Val Asn Leu Val Ser Asp Gly Arg Ile 275 280 285 CATCCA GGT AGA ATT GAA GAT ATG GTC GAA AAA GCT AGA AAA GAA GTA 912 His ProGly Arg Ile Glu Asp Met Val Glu Lys Ala Arg Lys Glu Val 290 295 300 GACGAT ATT ATT AGA GAA GCA GGT GAA CAA GCT ACA TTT GAA GTG AAC 960 Asp AspIle Ile Arg Glu Ala Gly Glu Gln Ala Thr Phe Glu Val Asn 305 310 315 320GCA CAT AAT ATG CAT CCT GAC TTA GTA AAA ATT GTA GGG CGT TTA AAC 1008 AlaHis Asn Met His Pro Asp Leu Val Lys Ile Val Gly Arg Leu Asn 325 330 335TAT CGT ACG AGT TAC GGT CAA AAT GTA CTT AAA CAT TCA ATT GAA GTT 1056 TyrArg Thr Ser Tyr Gly Gln Asn Val Leu Lys His Ser Ile Glu Val 340 345 350GCG CAT CTT GCT AGT ATG TTA GCT GCT GAG CTA GGC GAA GAT GAG ACA 1104 AlaHis Leu Ala Ser Met Leu Ala Ala Glu Leu Gly Glu Asp Glu Thr 355 360 365TTA GCG AAA CGA GCT GGA CTT TTA CAT GAT GTT GGT AAA GCA ATT GAT 1152 LeuAla Lys Arg Ala Gly Leu Leu His Asp Val Gly Lys Ala Ile Asp 370 375 380CAT GAA GTA GAA GGT AGT CAT GTT GAA ATC GGT GTA GAA TTA GCG AAA 1200 HisGlu Val Glu Gly Ser His Val Glu Ile Gly Val Glu Leu Ala Lys 385 390 395400 AAA TAT GGT GAA AAT GAA ACA GTT ATT AAT GCA ATC CAT TCT CAT CAT 1248Lys Tyr Gly Glu Asn Glu Thr Val Ile Asn Ala Ile His Ser His His 405 410415 GGT GAT GTT GAA CCT ACA TCT ATT ATA TCT ATC CTT GTT GCT GCT GCA 1296Gly Asp Val Glu Pro Thr Ser Ile Ile Ser Ile Leu Val Ala Ala Ala 420 425430 GAT GCA TTG TCT GCG GCT CGT CCA GGT GCA AGA AAA GAA ACA TTA GAG 1344Asp Ala Leu Ser Ala Ala Arg Pro Gly Ala Arg Lys Glu Thr Leu Glu 435 440445 AAT TAT ATT CGT CGA TTA GAA CGT TTA GAA ACG TTA TCA GAA AGT TAT 1392Asn Tyr Ile Arg Arg Leu Glu Arg Leu Glu Thr Leu Ser Glu Ser Tyr 450 455460 GAT GGT GTA GAA AAA GCA TTT GCG ATT CAG GCA GGT AGA GAA ATC CGA 1440Asp Gly Val Glu Lys Ala Phe Ala Ile Gln Ala Gly Arg Glu Ile Arg 465 470475 480 GTG ATT GTA TCT CCT GAA GAA ATT GAT GAT TTA AAA TCT TAT CGA TTG1488 Val Ile Val Ser Pro Glu Glu Ile Asp Asp Leu Lys Ser Tyr Arg Leu 485490 495 GCT AGA GAT ATT AAA AAT CAG ATT GAA GAT GAA TTA CAA TAT CCT GGT1536 Ala Arg Asp Ile Lys Asn Gln Ile Glu Asp Glu Leu Gln Tyr Pro Gly 500505 510 CAT ATC AAG GTG ACA GTT GTT CGA GAG ACT AGA GCA GTA GAA TAT GCG1584 His Ile Lys Val Thr Val Val Arg Glu Thr Arg Ala Val Glu Tyr Ala 515520 525 AAA TAA 1590 Lys 529 amino acids amino acid linear protein 74Arg Ala Asn Pro Arg Lys Gly Gly Val Cys Val Asn Leu Leu Ser Leu 1 5 1015 Leu Leu Ile Leu Leu Gly Ile Ile Leu Gly Val Val Gly Gly Tyr Val 20 2530 Val Ala Arg Asn Leu Leu Leu Gln Lys Gln Ser Gln Ala Arg Gln Thr 35 4045 Ala Glu Asp Ile Val Asn Gln Ala His Lys Glu Ala Asp Asn Ile Lys 50 5560 Lys Glu Lys Leu Leu Glu Ala Lys Glu Glu Asn Gln Ile Leu Arg Glu 65 7075 80 Gln Thr Glu Ala Glu Leu Arg Glu Arg Arg Ser Glu Leu Gln Arg Gln 8590 95 Glu Thr Arg Leu Leu Gln Lys Glu Glu Asn Leu Glu Arg Lys Ser Asp100 105 110 Leu Leu Asp Lys Lys Asp Glu Ile Leu Glu Gln Lys Glu Ser LysIle 115 120 125 Glu Glu Lys Gln Gln Gln Val Asp Ala Lys Glu Ser Ser ValGln Thr 130 135 140 Leu Ile Met Lys His Glu Gln Glu Leu Glu Arg Ile SerGly Leu Thr 145 150 155 160 Gln Glu Glu Ala Ile Asn Glu Gln Leu Gln ArgVal Glu Glu Glu Leu 165 170 175 Ser Gln Asp Ile Ala Val Leu Val Lys GluLys Glu Lys Glu Ala Lys 180 185 190 Glu Lys Val Asp Lys Thr Ala Lys GluLeu Leu Ala Thr Ala Val Gln 195 200 205 Arg Leu Ala Ala Asp His Thr SerGlu Ser Thr Val Ser Val Val Asn 210 215 220 Leu Pro Asn Asp Glu Met LysGly Arg Ile Ile Gly Arg Glu Gly Arg 225 230 235 240 Asn Ile Arg Thr LeuGlu Thr Leu Thr Gly Ile Asp Leu Ile Ile Asp 245 250 255 Asp Thr Pro GluAla Val Ile Leu Ser Gly Phe Asp Pro Ile Arg Arg 260 265 270 Glu Ile AlaArg Thr Ala Leu Val Asn Leu Val Ser Asp Gly Arg Ile 275 280 285 His ProGly Arg Ile Glu Asp Met Val Glu Lys Ala Arg Lys Glu Val 290 295 300 AspAsp Ile Ile Arg Glu Ala Gly Glu Gln Ala Thr Phe Glu Val Asn 305 310 315320 Ala His Asn Met His Pro Asp Leu Val Lys Ile Val Gly Arg Leu Asn 325330 335 Tyr Arg Thr Ser Tyr Gly Gln Asn Val Leu Lys His Ser Ile Glu Val340 345 350 Ala His Leu Ala Ser Met Leu Ala Ala Glu Leu Gly Glu Asp GluThr 355 360 365 Leu Ala Lys Arg Ala Gly Leu Leu His Asp Val Gly Lys AlaIle Asp 370 375 380 His Glu Val Glu Gly Ser His Val Glu Ile Gly Val GluLeu Ala Lys 385 390 395 400 Lys Tyr Gly Glu Asn Glu Thr Val Ile Asn AlaIle His Ser His His 405 410 415 Gly Asp Val Glu Pro Thr Ser Ile Ile SerIle Leu Val Ala Ala Ala 420 425 430 Asp Ala Leu Ser Ala Ala Arg Pro GlyAla Arg Lys Glu Thr Leu Glu 435 440 445 Asn Tyr Ile Arg Arg Leu Glu ArgLeu Glu Thr Leu Ser Glu Ser Tyr 450 455 460 Asp Gly Val Glu Lys Ala PheAla Ile Gln Ala Gly Arg Glu Ile Arg 465 470 475 480 Val Ile Val Ser ProGlu Glu Ile Asp Asp Leu Lys Ser Tyr Arg Leu 485 490 495 Ala Arg Asp IleLys Asn Gln Ile Glu Asp Glu Leu Gln Tyr Pro Gly 500 505 510 His Ile LysVal Thr Val Val Arg Glu Thr Arg Ala Val Glu Tyr Ala 515 520 525 Lys 799base pairs nucleic acid single linear other nucleic acid /desc = ”DNA(genomic) (p11c66)“ CDS 1..384 75 TCA TGG TAT TAT ATT GTT TGG AGT GATTTG ATG AGA TTT GTC TTT GAT 48 Ser Trp Tyr Tyr Ile Val Trp Ser Asp LeuMet Arg Phe Val Phe Asp 1 5 10 15 ATT GAT GGT ACG CTT TGT TTT GAC GGCCGA TTA ATT GAC CAG ACT ATT 96 Ile Asp Gly Thr Leu Cys Phe Asp Gly ArgLeu Ile Asp Gln Thr Ile 20 25 30 ATT GAT ACA TTG TTA CAA TTA CAA CAT GATGGT CAT GAA CTT ATA TTT 144 Ile Asp Thr Leu Leu Gln Leu Gln His Asp GlyHis Glu Leu Ile Phe 35 40 45 GCA TCA GCA CGT CCG ATT CGT GAT TTG TTG CCAGTT TTA CCA TCA GTA 192 Ala Ser Ala Arg Pro Ile Arg Asp Leu Leu Pro ValLeu Pro Ser Val 50 55 60 TTT CAT CAG CAC ACA TTA ATT GGC GCA AAT GGT GCTATG ATT TCA CAG 240 Phe His Gln His Thr Leu Ile Gly Ala Asn Gly Ala MetIle Ser Gln 65 70 75 80 CAA TCA AAG ATT TCT GTT ATC AAA CCA ATT CAT ACTGAT ACA TAT CAT 288 Gln Ser Lys Ile Ser Val Ile Lys Pro Ile His Thr AspThr Tyr His 85 90 95 CAT ATC TTC AAA ATA ATT CAA AAG TAT GAG TTA GAT TATATT ATT GAT 336 His Ile Phe Lys Ile Ile Gln Lys Tyr Glu Leu Asp Tyr IleIle Asp 100 105 110 GAT GAT TGG AAT TAT GCT GCA CAA CTT GAC GCT GNA GAACGC GAT TTT 384 Asp Asp Trp Asn Tyr Ala Ala Gln Leu Asp Ala Xaa Glu ArgAsp Phe 115 120 125 TGAGCGTTTA GATCCACATA AGCTGGCCAG TTGTATTGATGTTGCAAATA TCGACACGCC 444 AATCAAGAKT ATTTTATTAA ATATAGACCC GGCACAAATTACAACTATAT TAGACGAGCT 504 AGATAAATAC CATCAAGAAT TGGAAATGAT TCACCATTCAAATGAGTATA ACATTGATAT 564 AACAGCGCAA AATATTAACA AATATACTGC ATTACAATATATATTTGATG CAGATGTTAA 624 ATATATAGCA TTTGGTAATG ACCACAATGA TATTGTCATGTTACAACATG CTAGTAGTGG 684 CTATATTATA GGACCATCAG AAGCATACAC ACACGCAATATTGAAACTTG ATAAAATCAA 744 ACACATCAAT AATAATGCAC AAGCTATTTG CAAAGTCTTAAAATCATATA AATAA 799 128 amino acids amino acid linear protein 76 SerTrp Tyr Tyr Ile Val Trp Ser Asp Leu Met Arg Phe Val Phe Asp 1 5 10 15Ile Asp Gly Thr Leu Cys Phe Asp Gly Arg Leu Ile Asp Gln Thr Ile 20 25 30Ile Asp Thr Leu Leu Gln Leu Gln His Asp Gly His Glu Leu Ile Phe 35 40 45Ala Ser Ala Arg Pro Ile Arg Asp Leu Leu Pro Val Leu Pro Ser Val 50 55 60Phe His Gln His Thr Leu Ile Gly Ala Asn Gly Ala Met Ile Ser Gln 65 70 7580 Gln Ser Lys Ile Ser Val Ile Lys Pro Ile His Thr Asp Thr Tyr His 85 9095 His Ile Phe Lys Ile Ile Gln Lys Tyr Glu Leu Asp Tyr Ile Ile Asp 100105 110 Asp Asp Trp Asn Tyr Ala Ala Gln Leu Asp Ala Xaa Glu Arg Asp Phe115 120 125 799 base pairs nucleic acid single linear other nucleic acid/desc = ”DNA (genomic) (p11c66)“ CDS 335..796 77 TCATGGTATT ATATTGTTTGGAGTGATTTG ATGAGATTTG TCTTTGATAT TGATGGTACG 60 CTTTGTTTTG ACGGCCGATTAATTGACCAG ACTATTATTG ATACATTGTT ACAATTACAA 120 CATGATGGTC ATGAACTTATATTTGCATCA GCACGTCCGA TTCGTGATTT GTTGCCAGTT 180 TTACCATCAG TATTTCATCAGCACACATTA ATTGGCGCAA ATGGTGCTAT GATTTCACAG 240 CAATCAAAGA TTTCTGTTATCAAACCAATT CATACTGATA CATATCATCA TATCTTCAAA 300 ATAATTCAAA AGTATGAGTTAGATTATATT ATTG ATG ATG ATT GGA ATT ATG 352 Met Met Ile Gly Ile Met 1 5CTG CAC AAC TTG ACG CTG NAG AAC GCG ATT TTT GAG CGT TTA GAT CCA 400 LeuHis Asn Leu Thr Leu Xaa Asn Ala Ile Phe Glu Arg Leu Asp Pro 10 15 20 CATAAG CTG GCC AGT TGT ATT GAT GTT GCA AAT ATC GAC ACG CCA ATC 448 His LysLeu Ala Ser Cys Ile Asp Val Ala Asn Ile Asp Thr Pro Ile 25 30 35 AAG AKTATT TTA TTA AAT ATA GAC CCG GCA CAA ATT ACA ACT ATA TTA 496 Lys Xaa IleLeu Leu Asn Ile Asp Pro Ala Gln Ile Thr Thr Ile Leu 40 45 50 GAC GAG CTAGAT AAA TAC CAT CAA GAA TTG GAA ATG ATT CAC CAT TCA 544 Asp Glu Leu AspLys Tyr His Gln Glu Leu Glu Met Ile His His Ser 55 60 65 70 AAT GAG TATAAC ATT GAT ATA ACA GCG CAA AAT ATT AAC AAA TAT ACT 592 Asn Glu Tyr AsnIle Asp Ile Thr Ala Gln Asn Ile Asn Lys Tyr Thr 75 80 85 GCA TTA CAA TATATA TTT GAT GCA GAT GTT AAA TAT ATA GCA TTT GGT 640 Ala Leu Gln Tyr IlePhe Asp Ala Asp Val Lys Tyr Ile Ala Phe Gly 90 95 100 AAT GAC CAC AATGAT ATT GTC ATG TTA CAA CAT GCT AGT AGT GGC TAT 688 Asn Asp His Asn AspIle Val Met Leu Gln His Ala Ser Ser Gly Tyr 105 110 115 ATT ATA GGA CCATCA GAA GCA TAC ACA CAC GCA ATA TTG AAA CTT GAT 736 Ile Ile Gly Pro SerGlu Ala Tyr Thr His Ala Ile Leu Lys Leu Asp 120 125 130 AAA ATC AAA CACATC AAT AAT AAT GCA CAA GCT ATT TGC AAA GTC TTA 784 Lys Ile Lys His IleAsn Asn Asn Ala Gln Ala Ile Cys Lys Val Leu 135 140 145 150 AAA TCA TATAAA TAA 799 Lys Ser Tyr Lys 154 amino acids amino acid linear protein 78Met Met Ile Gly Ile Met Leu His Asn Leu Thr Leu Xaa Asn Ala Ile 1 5 1015 Phe Glu Arg Leu Asp Pro His Lys Leu Ala Ser Cys Ile Asp Val Ala 20 2530 Asn Ile Asp Thr Pro Ile Lys Xaa Ile Leu Leu Asn Ile Asp Pro Ala 35 4045 Gln Ile Thr Thr Ile Leu Asp Glu Leu Asp Lys Tyr His Gln Glu Leu 50 5560 Glu Met Ile His His Ser Asn Glu Tyr Asn Ile Asp Ile Thr Ala Gln 65 7075 80 Asn Ile Asn Lys Tyr Thr Ala Leu Gln Tyr Ile Phe Asp Ala Asp Val 8590 95 Lys Tyr Ile Ala Phe Gly Asn Asp His Asn Asp Ile Val Met Leu Gln100 105 110 His Ala Ser Ser Gly Tyr Ile Ile Gly Pro Ser Glu Ala Tyr ThrHis 115 120 125 Ala Ile Leu Lys Leu Asp Lys Ile Lys His Ile Asn Asn AsnAla Gln 130 135 140 Ala Ile Cys Lys Val Leu Lys Ser Tyr Lys 145 150 1236base pairs nucleic acid single linear other nucleic acid /desc = ”DNA(genomic) p5c34“ CDS 1..1233 79 AGC TCT ACG TTC AAC ATT TTT CCA TTG GGGATT CCA TGC TTG GGC TGT 48 Ser Ser Thr Phe Asn Ile Phe Pro Leu Gly IlePro Cys Leu Gly Cys 1 5 10 15 TTA TGG TGT TGT TGC GTT ACG TTG GCA TATTCG CAA TTC CGT AAA GGT 96 Leu Trp Cys Cys Cys Val Thr Leu Ala Tyr SerGln Phe Arg Lys Gly 20 25 30 GAA CCA GGT TTA TTA TCT AGA ACT TTA CGT CCTCTT TTA GGT GAT AAA 144 Glu Pro Gly Leu Leu Ser Arg Thr Leu Arg Pro LeuLeu Gly Asp Lys 35 40 45 GTA GAA GGT CCT ATT GGG ATT TTT ATT GAC GTT TTATCT GTA TTT GCG 192 Val Glu Gly Pro Ile Gly Ile Phe Ile Asp Val Leu SerVal Phe Ala 50 55 60 ACA ATC GTT GGG GTA GCC GTT TCG TTA GGT ATG GGT GCTCTA CAA ATT 240 Thr Ile Val Gly Val Ala Val Ser Leu Gly Met Gly Ala LeuGln Ile 65 70 75 80 AAT GGT GGT TTA CAT TAC TTG TTC AAT GTT CCA AAC AATACG TTT GTA 288 Asn Gly Gly Leu His Tyr Leu Phe Asn Val Pro Asn Asn ThrPhe Val 85 90 95 CAA GCG ATT ATC ATC ATT GTT GTT ACT ATC TTA TTT ATA GCAAGT GCA 336 Gln Ala Ile Ile Ile Ile Val Val Thr Ile Leu Phe Ile Ala SerAla 100 105 110 TGG TCT GGA TTA AGT AAA GGT ATT CAA TAC TTA AGT AAC TTGAAC ATT 384 Trp Ser Gly Leu Ser Lys Gly Ile Gln Tyr Leu Ser Asn Leu AsnIle 115 120 125 GGT TTA GGT ACT ATT TTA ATG GTA GCT GCT TTA ATT GTT GGACCA ACT 432 Gly Leu Gly Thr Ile Leu Met Val Ala Ala Leu Ile Val Gly ProThr 130 135 140 GTT CTT ATT TTA AAT ATG TTA ACT AGC TCT ACG GGT AGT TTACTA AAC 480 Val Leu Ile Leu Asn Met Leu Thr Ser Ser Thr Gly Ser Leu LeuAsn 145 150 155 160 ACA TTC TTG TTT AAT AGT TTT GAT ACA GCA GCT TTA AATCCT CAA AAA 528 Thr Phe Leu Phe Asn Ser Phe Asp Thr Ala Ala Leu Asn ProGln Lys 165 170 175 CGT GAA TGG ATG TCT TCA TGG ACA CTT TAT TAC TGG GGTTGG TGG TTA 576 Arg Glu Trp Met Ser Ser Trp Thr Leu Tyr Tyr Trp Gly TrpTrp Leu 180 185 190 AGT TGG AGT CCA TTC GTT GGA GTG TTT ATT GCA CGA GTTTCA AAA GGA 624 Ser Trp Ser Pro Phe Val Gly Val Phe Ile Ala Arg Val SerLys Gly 195 200 205 CGT TCA ATT AGA GAG TTC ATT TCT GGT GTC TTG CTA GTTCCA GCA ATT 672 Arg Ser Ile Arg Glu Phe Ile Ser Gly Val Leu Leu Val ProAla Ile 210 215 220 GTT AGT TTT GTT TGG TTT AGT GTC TTT GGT GTA TTA GGCATC GAG ACA 720 Val Ser Phe Val Trp Phe Ser Val Phe Gly Val Leu Gly IleGlu Thr 225 230 235 240 GGT AAG AAA CAC AAA GAA ATT TTT GAT ATG ACT CCTGAA ACA CAG CTA 768 Gly Lys Lys His Lys Glu Ile Phe Asp Met Thr Pro GluThr Gln Leu 245 250 255 TTT GGA GTG TTT AAT CAT GTG CCA TTT GGC ATT GTTTTA TCG TTG ATT 816 Phe Gly Val Phe Asn His Val Pro Phe Gly Ile Val LeuSer Leu Ile 260 265 270 GCA TTA TTA TTA ATT GCA TCA TTC TTT ATT ACA TCTGCT GAC TCA GCA 864 Ala Leu Leu Leu Ile Ala Ser Phe Phe Ile Thr Ser AlaAsp Ser Ala 275 280 285 ACA TTT GTA TTA GGA ATG CAA ACA ACA TTT GGT TCATTA AAT CCA TCT 912 Thr Phe Val Leu Gly Met Gln Thr Thr Phe Gly Ser LeuAsn Pro Ser 290 295 300 AGT ATG GTA AAA GTT GTT TGG GGA ATT TCA CAG GCCTTA ATA GCA TTT 960 Ser Met Val Lys Val Val Trp Gly Ile Ser Gln Ala LeuIle Ala Phe 305 310 315 320 GTA CTT TTA TTA GCT GGT GGC GGT AAC GGC GCTGAA GCT TTA AAT GCG 1008 Val Leu Leu Leu Ala Gly Gly Gly Asn Gly Ala GluAla Leu Asn Ala 325 330 335 ATT CAA AGT GCT GCA ATT ATA AGT GCA TTC CCATTC TCC TTT GTC GTC 1056 Ile Gln Ser Ala Ala Ile Ile Ser Ala Phe Pro PheSer Phe Val Val 340 345 350 ATA CTC ATG ATG GTA AGT TTC TAC AAG GAT GCGAAC CAG GAA CGT AAA 1104 Ile Leu Met Met Val Ser Phe Tyr Lys Asp Ala AsnGln Glu Arg Lys 355 360 365 TTC CTA GGT TTA ACA TTG ACT CCG AAT AAA CATCGC TTA CAA GAA TAT 1152 Phe Leu Gly Leu Thr Leu Thr Pro Asn Lys His ArgLeu Gln Glu Tyr 370 375 380 ATC AAG AGT CAA CAA GAA GAT TAT GAA TCT GACATT CTT GAA AAG CGT 1200 Ile Lys Ser Gln Gln Glu Asp Tyr Glu Ser Asp IleLeu Glu Lys Arg 385 390 395 400 CAG TCA CGT AGA AAT ATA GAG AAA AAA GATAAC TAA 1236 Gln Ser Arg Arg Asn Ile Glu Lys Lys Asp Asn 405 410 411amino acids amino acid linear protein 80 Ser Ser Thr Phe Asn Ile Phe ProLeu Gly Ile Pro Cys Leu Gly Cys 1 5 10 15 Leu Trp Cys Cys Cys Val ThrLeu Ala Tyr Ser Gln Phe Arg Lys Gly 20 25 30 Glu Pro Gly Leu Leu Ser ArgThr Leu Arg Pro Leu Leu Gly Asp Lys 35 40 45 Val Glu Gly Pro Ile Gly IlePhe Ile Asp Val Leu Ser Val Phe Ala 50 55 60 Thr Ile Val Gly Val Ala ValSer Leu Gly Met Gly Ala Leu Gln Ile 65 70 75 80 Asn Gly Gly Leu His TyrLeu Phe Asn Val Pro Asn Asn Thr Phe Val 85 90 95 Gln Ala Ile Ile Ile IleVal Val Thr Ile Leu Phe Ile Ala Ser Ala 100 105 110 Trp Ser Gly Leu SerLys Gly Ile Gln Tyr Leu Ser Asn Leu Asn Ile 115 120 125 Gly Leu Gly ThrIle Leu Met Val Ala Ala Leu Ile Val Gly Pro Thr 130 135 140 Val Leu IleLeu Asn Met Leu Thr Ser Ser Thr Gly Ser Leu Leu Asn 145 150 155 160 ThrPhe Leu Phe Asn Ser Phe Asp Thr Ala Ala Leu Asn Pro Gln Lys 165 170 175Arg Glu Trp Met Ser Ser Trp Thr Leu Tyr Tyr Trp Gly Trp Trp Leu 180 185190 Ser Trp Ser Pro Phe Val Gly Val Phe Ile Ala Arg Val Ser Lys Gly 195200 205 Arg Ser Ile Arg Glu Phe Ile Ser Gly Val Leu Leu Val Pro Ala Ile210 215 220 Val Ser Phe Val Trp Phe Ser Val Phe Gly Val Leu Gly Ile GluThr 225 230 235 240 Gly Lys Lys His Lys Glu Ile Phe Asp Met Thr Pro GluThr Gln Leu 245 250 255 Phe Gly Val Phe Asn His Val Pro Phe Gly Ile ValLeu Ser Leu Ile 260 265 270 Ala Leu Leu Leu Ile Ala Ser Phe Phe Ile ThrSer Ala Asp Ser Ala 275 280 285 Thr Phe Val Leu Gly Met Gln Thr Thr PheGly Ser Leu Asn Pro Ser 290 295 300 Ser Met Val Lys Val Val Trp Gly IleSer Gln Ala Leu Ile Ala Phe 305 310 315 320 Val Leu Leu Leu Ala Gly GlyGly Asn Gly Ala Glu Ala Leu Asn Ala 325 330 335 Ile Gln Ser Ala Ala IleIle Ser Ala Phe Pro Phe Ser Phe Val Val 340 345 350 Ile Leu Met Met ValSer Phe Tyr Lys Asp Ala Asn Gln Glu Arg Lys 355 360 365 Phe Leu Gly LeuThr Leu Thr Pro Asn Lys His Arg Leu Gln Glu Tyr 370 375 380 Ile Lys SerGln Gln Glu Asp Tyr Glu Ser Asp Ile Leu Glu Lys Arg 385 390 395 400 GlnSer Arg Arg Asn Ile Glu Lys Lys Asp Asn 405 410 477 base pairs nucleicacid single linear other nucleic acid /desc = ”DNA (genomic) (p10c18)“81 ATTATAGAGT TTTTCCGCTG TTTCTAAAGT TAAATTATTC ACTTTTCTTT TCCCGCTTCT 60TAATTCACTT ATACCACCAT AAGAAACATC AGTGTCTTGA TTAATTCTAT AATTAGATAT 120TGATYTATCA TTTAGTARTT TTTCTATTGT ATTATNAATT TCTTTAARCT GGTTCWTAAT 180TTTNGTCNAA ATGAAAGAAT AATTTATTTT NTCTCTANGT TATATTAATC AATNAANTAA 240TATTANAGTT GCAANTTAAG NATAGAGAGT TNATTTTTTT CCTTTANATT CCTCCTTGGT 300CNCTNAANAT TANCCANCCT NCCCCTTATT TTTAAATTTG GTGGGTNNAA AGGGGTTNTT 360CTNGGCCCCN TTTTTNCCCT NTTTTTTTTN ANAAAGNANC CGNAATTCTG GATCCTAACT 420TTTANTNTTN NCTTGGGAAC TCTAATTCAA GGGNCCTCNG CTCGNCNAGC NCTAATT 477 1212base pairs nucleic acid single linear other nucleic acid /desc = ”DNA(genomic) (p12c3)“ CDS 1..1209 82 GAA AGA AAG CGT TTT CAC ATA ACA AAGGGG GAG TTT CAA ATG AAA GTC 48 Glu Arg Lys Arg Phe His Ile Thr Lys GlyGlu Phe Gln Met Lys Val 1 5 10 15 GAA GTT TAT AAA GGA GCG CAA GGT AAACAT AAC CTT AAA GAT TAT GAA 96 Glu Val Tyr Lys Gly Ala Gln Gly Lys HisAsn Leu Lys Asp Tyr Glu 20 25 30 GAA ACA TAT AAT ACT TTT GAT TGG AAA GACGTA GAA CAA GCA TTT TCT 144 Glu Thr Tyr Asn Thr Phe Asp Trp Lys Asp ValGlu Gln Ala Phe Ser 35 40 45 TGG AGT GAA ACT GGA AAA ATG AAC ATG GCA TATGAA TGC ATA GAT CGC 192 Trp Ser Glu Thr Gly Lys Met Asn Met Ala Tyr GluCys Ile Asp Arg 50 55 60 CAT GTA GAT CAA GGA TTA GGG GAT AAA ATA GCG TTAAAT TAC AAA GAT 240 His Val Asp Gln Gly Leu Gly Asp Lys Ile Ala Leu AsnTyr Lys Asp 65 70 75 80 GAG CAC AGA AAA GAA TCG TAT ACT TAT AAA GAT ATGCAA CGG TTA TCT 288 Glu His Arg Lys Glu Ser Tyr Thr Tyr Lys Asp Met GlnArg Leu Ser 85 90 95 AAT AAA GCA GCG AAT GTT TTG TCT GAA CAT GCA GAA GTTGAC AAA GGT 336 Asn Lys Ala Ala Asn Val Leu Ser Glu His Ala Glu Val AspLys Gly 100 105 110 GAC AGA GTA TTT ATA TTT ATG TCG CGT ACA CCT GAA CTATAT TTT GCG 384 Asp Arg Val Phe Ile Phe Met Ser Arg Thr Pro Glu Leu TyrPhe Ala 115 120 125 TTG TTA GGT GTT TTA AAA ATT GGT GCA ATT GTT GGG CCGTTA TTT GAA 432 Leu Leu Gly Val Leu Lys Ile Gly Ala Ile Val Gly Pro LeuPhe Glu 130 135 140 GCA TTT ATG GAA AAG GCA GTT GCG GAT AGA TTA GAG AACAGT GAA GCT 480 Ala Phe Met Glu Lys Ala Val Ala Asp Arg Leu Glu Asn SerGlu Ala 145 150 155 160 AAA GTG TTA ATT ACT AAT AAG GCA TTG TTA CCT CGAGTA CCT GTA GAT 528 Lys Val Leu Ile Thr Asn Lys Ala Leu Leu Pro Arg ValPro Val Asp 165 170 175 AAA TTA CCA AAC TTG AAA AAA ATT GTT GTC GTA GATGAG GAT GTA GAA 576 Lys Leu Pro Asn Leu Lys Lys Ile Val Val Val Asp GluAsp Val Glu 180 185 190 GAC AAT TAC ATA GAC TTC ATT AGT TTG ATG GAA ACTGCT AGC GAT GAA 624 Asp Asn Tyr Ile Asp Phe Ile Ser Leu Met Glu Thr AlaSer Asp Glu 195 200 205 TTT GAC ATT GAA TGG TTA AAG TCG GAT GAT GGT TTGATT TTA CAT TAT 672 Phe Asp Ile Glu Trp Leu Lys Ser Asp Asp Gly Leu IleLeu His Tyr 210 215 220 ACA TCA GGT TCT ACT GGG CAA CCT AAA GGT GTA TTGCAT GTT CAA CAA 720 Thr Ser Gly Ser Thr Gly Gln Pro Lys Gly Val Leu HisVal Gln Gln 225 230 235 240 GCA ATG TTA GTG CAC TAT ATT TCT GGR AAA TATGTA TTA GAT TTA CAA 768 Ala Met Leu Val His Tyr Ile Ser Gly Lys Tyr ValLeu Asp Leu Gln 245 250 255 GAA GAT GAT GTT TAT TGG TGT ACA GCA GAT CCAGGT TGG GTT ACA GGA 816 Glu Asp Asp Val Tyr Trp Cys Thr Ala Asp Pro GlyTrp Val Thr Gly 260 265 270 ACA TCT TAT GGT ATT TTT GCA CCA TGG TTA AATGGC GCT ACA AAT TGT 864 Thr Ser Tyr Gly Ile Phe Ala Pro Trp Leu Asn GlyAla Thr Asn Cys 275 280 285 ATA GCT GGT GGT CGC TTT TCG CCA GAA CAG TGGTAT AGT ATG ATT GAA 912 Ile Ala Gly Gly Arg Phe Ser Pro Glu Gln Trp TyrSer Met Ile Glu 290 295 300 GAT TTT AAA GTG ACG ATT TGG TAT ACG GCA CCAACA GCT TTA AGA ATG 960 Asp Phe Lys Val Thr Ile Trp Tyr Thr Ala Pro ThrAla Leu Arg Met 305 310 315 320 TTA ATG AGT GCT GGT GAC GAT ATT GTT GAGAAA TAT GAC TTG TCA TCG 1008 Leu Met Ser Ala Gly Asp Asp Ile Val Glu LysTyr Asp Leu Ser Ser 325 330 335 TTA CGT TCG ATT CTA TCA GTA GGT GAG CCTTTA AAT CCT GAA GTT ATA 1056 Leu Arg Ser Ile Leu Ser Val Gly Glu Pro LeuAsn Pro Glu Val Ile 340 345 350 AAA TGG GCG AAA AAA GTA TAC GGT TTA ACGGTG TTA GAT ACT TGG TGG 1104 Lys Trp Ala Lys Lys Val Tyr Gly Leu Thr ValLeu Asp Thr Trp Trp 355 360 365 ATG ACA GAA ACA GGT GGA CAT ATG ATT GTTAAC TAT CCA ACG ATG GAC 1152 Met Thr Glu Thr Gly Gly His Met Ile Val AsnTyr Pro Thr Met Asp 370 375 380 GTN CAA GCT TGG CTC MAT GGG CAA ACC ATTACC TGG TAT TCA AGC TGC 1200 Val Gln Ala Trp Leu Xaa Gly Gln Thr Ile ThrTrp Tyr Ser Ser Cys 385 390 395 400 AAT TAT CGA TGA 1212 Asn Tyr Arg 403amino acids amino acid linear protein 83 Glu Arg Lys Arg Phe His Ile ThrLys Gly Glu Phe Gln Met Lys Val 1 5 10 15 Glu Val Tyr Lys Gly Ala GlnGly Lys His Asn Leu Lys Asp Tyr Glu 20 25 30 Glu Thr Tyr Asn Thr Phe AspTrp Lys Asp Val Glu Gln Ala Phe Ser 35 40 45 Trp Ser Glu Thr Gly Lys MetAsn Met Ala Tyr Glu Cys Ile Asp Arg 50 55 60 His Val Asp Gln Gly Leu GlyAsp Lys Ile Ala Leu Asn Tyr Lys Asp 65 70 75 80 Glu His Arg Lys Glu SerTyr Thr Tyr Lys Asp Met Gln Arg Leu Ser 85 90 95 Asn Lys Ala Ala Asn ValLeu Ser Glu His Ala Glu Val Asp Lys Gly 100 105 110 Asp Arg Val Phe IlePhe Met Ser Arg Thr Pro Glu Leu Tyr Phe Ala 115 120 125 Leu Leu Gly ValLeu Lys Ile Gly Ala Ile Val Gly Pro Leu Phe Glu 130 135 140 Ala Phe MetGlu Lys Ala Val Ala Asp Arg Leu Glu Asn Ser Glu Ala 145 150 155 160 LysVal Leu Ile Thr Asn Lys Ala Leu Leu Pro Arg Val Pro Val Asp 165 170 175Lys Leu Pro Asn Leu Lys Lys Ile Val Val Val Asp Glu Asp Val Glu 180 185190 Asp Asn Tyr Ile Asp Phe Ile Ser Leu Met Glu Thr Ala Ser Asp Glu 195200 205 Phe Asp Ile Glu Trp Leu Lys Ser Asp Asp Gly Leu Ile Leu His Tyr210 215 220 Thr Ser Gly Ser Thr Gly Gln Pro Lys Gly Val Leu His Val GlnGln 225 230 235 240 Ala Met Leu Val His Tyr Ile Ser Gly Lys Tyr Val LeuAsp Leu Gln 245 250 255 Glu Asp Asp Val Tyr Trp Cys Thr Ala Asp Pro GlyTrp Val Thr Gly 260 265 270 Thr Ser Tyr Gly Ile Phe Ala Pro Trp Leu AsnGly Ala Thr Asn Cys 275 280 285 Ile Ala Gly Gly Arg Phe Ser Pro Glu GlnTrp Tyr Ser Met Ile Glu 290 295 300 Asp Phe Lys Val Thr Ile Trp Tyr ThrAla Pro Thr Ala Leu Arg Met 305 310 315 320 Leu Met Ser Ala Gly Asp AspIle Val Glu Lys Tyr Asp Leu Ser Ser 325 330 335 Leu Arg Ser Ile Leu SerVal Gly Glu Pro Leu Asn Pro Glu Val Ile 340 345 350 Lys Trp Ala Lys LysVal Tyr Gly Leu Thr Val Leu Asp Thr Trp Trp 355 360 365 Met Thr Glu ThrGly Gly His Met Ile Val Asn Tyr Pro Thr Met Asp 370 375 380 Val Gln AlaTrp Leu Xaa Gly Gln Thr Ile Thr Trp Tyr Ser Ser Cys 385 390 395 400 AsnTyr Arg 3147 base pairs nucleic acid single linear other nucleic acid/desc = ”DNA (genomic) (p14b25)“ CDS 1..3144 84 ATG AAA ATT CTC AGC CTGCGC CTG AAA AAC CTG AAC TCA TTA AAA GGC 48 Met Lys Ile Leu Ser Leu ArgLeu Lys Asn Leu Asn Ser Leu Lys Gly 1 5 10 15 GAA TGG AAG ATT GAT TTCACC CGC GAG CCG TTC GCC AGC AAC GGG CTG 96 Glu Trp Lys Ile Asp Phe ThrArg Glu Pro Phe Ala Ser Asn Gly Leu 20 25 30 TTT GCT ATT ACC GGC CCA ACAGGT GCG GGG AAA ACC ACC CTG CTG GAC 144 Phe Ala Ile Thr Gly Pro Thr GlyAla Gly Lys Thr Thr Leu Leu Asp 35 40 45 GCC ATT TGT CTG GCG CTG TAT CACGAA ACT CCG CGT CTC TCT AAC GTT 192 Ala Ile Cys Leu Ala Leu Tyr His GluThr Pro Arg Leu Ser Asn Val 50 55 60 TCA CAA TCG CAA AAT GAT CTC ATG ACCCGC GAT ACC GCC GAA TGT CTG 240 Ser Gln Ser Gln Asn Asp Leu Met Thr ArgAsp Thr Ala Glu Cys Leu 65 70 75 80 GCG GAG GTG GAG TTT GAA GTG AAA GGTGAA GCG TAC CGT GCA TTC TGG 288 Ala Glu Val Glu Phe Glu Val Lys Gly GluAla Tyr Arg Ala Phe Trp 85 90 95 AGC CAG AAT CGG GCG CGT AAC CAA CCC GACGGT AAT TTG CAG GTG CCA 336 Ser Gln Asn Arg Ala Arg Asn Gln Pro Asp GlyAsn Leu Gln Val Pro 100 105 110 CGC GTA GAG CTG GCG CGC TGC GCC GAC GGCAAA ATT CTC GCC GAC AAA 384 Arg Val Glu Leu Ala Arg Cys Ala Asp Gly LysIle Leu Ala Asp Lys 115 120 125 GTG AAA GAT AAG CTG GAA CTG ACA GCG ACGTTA ACC GGG CTG GAT TAC 432 Val Lys Asp Lys Leu Glu Leu Thr Ala Thr LeuThr Gly Leu Asp Tyr 130 135 140 GGG CGC TTC ACC CGT TCG ATG CTG CTT TCGCAG GGG CAA TTT GCT GCC 480 Gly Arg Phe Thr Arg Ser Met Leu Leu Ser GlnGly Gln Phe Ala Ala 145 150 155 160 TTC CTG AAT GCC AAA CCC AAA GAA CGCGCG GAA TTG CTC GAG GAG TTA 528 Phe Leu Asn Ala Lys Pro Lys Glu Arg AlaGlu Leu Leu Glu Glu Leu 165 170 175 ACC GGC ACT GAA ATC TAC GGG CAA ATCTCG GCG ATG GTT TTT GAG CAG 576 Thr Gly Thr Glu Ile Tyr Gly Gln Ile SerAla Met Val Phe Glu Gln 180 185 190 CAC AAA TCG GCC CGC ACA GAG CTG GAGAAG CTG CAA GCG CAG GCC AGC 624 His Lys Ser Ala Arg Thr Glu Leu Glu LysLeu Gln Ala Gln Ala Ser 195 200 205 GGC GTC ACG TTG CTC ACG CCG GAA CAAGTG CAA TCG CTG ACA GCG AGT 672 Gly Val Thr Leu Leu Thr Pro Glu Gln ValGln Ser Leu Thr Ala Ser 210 215 220 TTG CAG GTA CTT ACT GAC GAA GAA AAACAG TTA ATT ACC GCG CAG CAG 720 Leu Gln Val Leu Thr Asp Glu Glu Lys GlnLeu Ile Thr Ala Gln Gln 225 230 235 240 CAA GAA CAA CAA TCG CTA AAC TGGTTA ACG CGT CAG GAC GAA TTG CAG 768 Gln Glu Gln Gln Ser Leu Asn Trp LeuThr Arg Gln Asp Glu Leu Gln 245 250 255 CAA GAA GCC AGC CGC CGT CAG CAGGCC TTG CAA CAG GCG TTA GCC GAA 816 Gln Glu Ala Ser Arg Arg Gln Gln AlaLeu Gln Gln Ala Leu Ala Glu 260 265 270 GAA GAA AAA GCG CAA CCT CAA CTGGCG GCG CTT AGT CTG GCA CAA CCG 864 Glu Glu Lys Ala Gln Pro Gln Leu AlaAla Leu Ser Leu Ala Gln Pro 275 280 285 GCA CGA AAT CTT CGT CCA CAC TGGGAA CGC ATC GCA GAA CAC AGC GCG 912 Ala Arg Asn Leu Arg Pro His Trp GluArg Ile Ala Glu His Ser Ala 290 295 300 GCG CTG GCG CAT ATT CGC CAG CAGATT GAA GAA GTA AAT ACT CGC TTA 960 Ala Leu Ala His Ile Arg Gln Gln IleGlu Glu Val Asn Thr Arg Leu 305 310 315 320 CAG AGC ACA ATG GCG CTT CGCGCG AGC ATT CGC CAC CAC GCG GCG AAG 1008 Gln Ser Thr Met Ala Leu Arg AlaSer Ile Arg His His Ala Ala Lys 325 330 335 CAG TCA GCA GAA TTA CAG CAGCAG CAA CAA AGC CTG AAT ACC TGG TTA 1056 Gln Ser Ala Glu Leu Gln Gln GlnGln Gln Ser Leu Asn Thr Trp Leu 340 345 350 CAG GAA CAC GAC CGC TTC CGTCAG TGG AAC AAC GAA CCG GCG GGT TGG 1104 Gln Glu His Asp Arg Phe Arg GlnTrp Asn Asn Glu Pro Ala Gly Trp 355 360 365 CGT GCG CAG TTC TCC CAA CAAACC AGC GAT CGC GAG CAT CTG CGG CAA 1152 Arg Ala Gln Phe Ser Gln Gln ThrSer Asp Arg Glu His Leu Arg Gln 370 375 380 TGG CAG CAA CAG TTA ACC CATGCT GAG CAA AAA CTT AAT GCG CTT GCG 1200 Trp Gln Gln Gln Leu Thr His AlaGlu Gln Lys Leu Asn Ala Leu Ala 385 390 395 400 GCG ATC ACG TTG ACG TTAACC GCC GAT GAA GTT GCT ACC GCC CTG GCG 1248 Ala Ile Thr Leu Thr Leu ThrAla Asp Glu Val Ala Thr Ala Leu Ala 405 410 415 CAA CAT GCT GAG CAA CGCCCA CTG CGT CAG CAC CTG GTC GCG CTG CAT 1296 Gln His Ala Glu Gln Arg ProLeu Arg Gln His Leu Val Ala Leu His 420 425 430 GGA CAG ATT GTT CCC CAACAA AAA CGT CTG GCG CAG TTA CAG GTC GCT 1344 Gly Gln Ile Val Pro Gln GlnLys Arg Leu Ala Gln Leu Gln Val Ala 435 440 445 ATC CAG AAT GTC ACG CAAGAA CAG ACG CAA CGT AAC GCC GCA CTT AAC 1392 Ile Gln Asn Val Thr Gln GluGln Thr Gln Arg Asn Ala Ala Leu Asn 450 455 460 GAA ATG CGC CAG CGT TATAAA GAA AAG ACG CAG CAA CTT GCC GAT GTG 1440 Glu Met Arg Gln Arg Tyr LysGlu Lys Thr Gln Gln Leu Ala Asp Val 465 470 475 480 AAA ACC ATT TGC GAGCAG GAA GCG CGC ATC AAA ACG CTG GAA GCT CAA 1488 Lys Thr Ile Cys Glu GlnGlu Ala Arg Ile Lys Thr Leu Glu Ala Gln 485 490 495 CGT GCA CAG TTA CAGGCG GGT CAG CCT TGC CCA CTT TGT GGT TCC ACC 1536 Arg Ala Gln Leu Gln AlaGly Gln Pro Cys Pro Leu Cys Gly Ser Thr 500 505 510 AGC CAC CCG GCG GTCGAG GCG TAT CAG GCG CTG GAG CCT GGC GTT AAT 1584 Ser His Pro Ala Val GluAla Tyr Gln Ala Leu Glu Pro Gly Val Asn 515 520 525 CAG TCT CGA TTA CTGGCG CTG GAA AAC GAA GTT AAA AAG CTC GGT GAA 1632 Gln Ser Arg Leu Leu AlaLeu Glu Asn Glu Val Lys Lys Leu Gly Glu 530 535 540 GAA GGT GCG ACG CTACGT GGG CAA CTG GAC GCC ATA ACA AAG CAG CTT 1680 Glu Gly Ala Thr Leu ArgGly Gln Leu Asp Ala Ile Thr Lys Gln Leu 545 550 555 560 CAG CGT GAT GAAAAC GAA GCG CAA AGC CTC CGA CAA GAT GAG CAA GCA 1728 Gln Arg Asp Glu AsnGlu Ala Gln Ser Leu Arg Gln Asp Glu Gln Ala 565 570 575 CTT ACT CAA CAATGG CAA GCC GTC ACG GCC AGC CTC AAT ATC ACC TTG 1776 Leu Thr Gln Gln TrpGln Ala Val Thr Ala Ser Leu Asn Ile Thr Leu 580 585 590 CAG CCA CTG GACGAT ATT CAA CCG TGG CTG GAT GCA CAA GAT GAG CAC 1824 Gln Pro Leu Asp AspIle Gln Pro Trp Leu Asp Ala Gln Asp Glu His 595 600 605 GAA CGC CAG CTGCGG TTA CTC AGC CAA CGG CAT GAA TTA CAA GGG CAG 1872 Glu Arg Gln Leu ArgLeu Leu Ser Gln Arg His Glu Leu Gln Gly Gln 610 615 620 ATT GCC GCG CATAAT CAG CAA ATT ATC CAG TAT CAA CAG CAA ATT GAA 1920 Ile Ala Ala His AsnGln Gln Ile Ile Gln Tyr Gln Gln Gln Ile Glu 625 630 635 640 CAA CGC CAGCAA CTA CTT TTA ACG ACA TTG ACG GGT TAT GCA CTG ACA 1968 Gln Arg Gln GlnLeu Leu Leu Thr Thr Leu Thr Gly Tyr Ala Leu Thr 645 650 655 TTG CCA CAGGAA GAT GAA GAA GAG AGC TGG TTG GCG ACA CGT CAG CAA 2016 Leu Pro Gln GluAsp Glu Glu Glu Ser Trp Leu Ala Thr Arg Gln Gln 660 665 670 GAA GCG CAGAGC TGG CAG CAA CGC CAG AAC GAA TTA ACC GCG CTG CAA 2064 Glu Ala Gln SerTrp Gln Gln Arg Gln Asn Glu Leu Thr Ala Leu Gln 675 680 685 AAC CGT ATTCAG CAG CTG ACG CCG ATT CTG GAA ACG TTG CCG CAA AGT 2112 Asn Arg Ile GlnGln Leu Thr Pro Ile Leu Glu Thr Leu Pro Gln Ser 690 695 700 GAT GAA CTCCCG CAC TGC GAA GAA ACT GTG GTA TTG GAA AAC TGG CGG 2160 Asp Glu Leu ProHis Cys Glu Glu Thr Val Val Leu Glu Asn Trp Arg 705 710 715 720 CAG GTACAT GAA CAA TGT CTC GCA TTA CAC AGC CAG CAG CAG ACG TTA 2208 Gln Val HisGlu Gln Cys Leu Ala Leu His Ser Gln Gln Gln Thr Leu 725 730 735 CAG CAACAG GAT GTT CTG GCG GCG CAA AGT CTG CAA AAA GCC CAG GCG 2256 Gln Gln GlnAsp Val Leu Ala Ala Gln Ser Leu Gln Lys Ala Gln Ala 740 745 750 CAG TTTGAC ACC GCG CTA CAG GCC AGC GTC TTT GAC GAT CAG CAG GCG 2304 Gln Phe AspThr Ala Leu Gln Ala Ser Val Phe Asp Asp Gln Gln Ala 755 760 765 TTC CTTGCG GCG CTA ATG GAT GAA CAA ACA CTA ACG CAG CTG GAA CAG 2352 Phe Leu AlaAla Leu Met Asp Glu Gln Thr Leu Thr Gln Leu Glu Gln 770 775 780 CTC AAGCAG AAT CTG GAA AAC CAG CGC CGT CAG GCG CAA ACT CTG GTC 2400 Leu Lys GlnAsn Leu Glu Asn Gln Arg Arg Gln Ala Gln Thr Leu Val 785 790 795 800 ACTCAG ACA GCA GAA ACG CTG GCA CAG CAT CAA CAA CAC CGA CCT GAC 2448 Thr GlnThr Ala Glu Thr Leu Ala Gln His Gln Gln His Arg Pro Asp 805 810 815 GACGGG TTG GCT CTC ACT GTG ACG GTG GAG CAG ATT CAG CAA GAG TTA 2496 Asp GlyLeu Ala Leu Thr Val Thr Val Glu Gln Ile Gln Gln Glu Leu 820 825 830 GCGCAA ACT CAC CAA AAG TTG CGT GAA AAC ACC ACG AGT CAA GGC GAG 2544 Ala GlnThr His Gln Lys Leu Arg Glu Asn Thr Thr Ser Gln Gly Glu 835 840 845 ATTCGC CAG CAG CTG AAG CAG GAT GCA GAT AAC CGT CAG CAA CAA CAA 2592 Ile ArgGln Gln Leu Lys Gln Asp Ala Asp Asn Arg Gln Gln Gln Gln 850 855 860 ACCTTA ATG CAG CAA ATT GCT CAA ATG ACG CAG CAG GTT GAG GAC TGG 2640 Thr LeuMet Gln Gln Ile Ala Gln Met Thr Gln Gln Val Glu Asp Trp 865 870 875 880GGA TAT CTG AAT TCG CTA ATA GGT TCC AAA GAG GGC GAT AAA TTC CGC 2688 GlyTyr Leu Asn Ser Leu Ile Gly Ser Lys Glu Gly Asp Lys Phe Arg 885 890 895AAG TTT GCC CAG GGG CTG ACG CTG GAT AAT TTA GTC CAT CTC GCT AAT 2736 LysPhe Ala Gln Gly Leu Thr Leu Asp Asn Leu Val His Leu Ala Asn 900 905 910CAG CAA CTT ACC CGG CTG CAC GGG CGC TAT CTG TTA CAG CGC AAA GCC 2784 GlnGln Leu Thr Arg Leu His Gly Arg Tyr Leu Leu Gln Arg Lys Ala 915 920 925AGC GAG GCG CTG GAA GTC GAG GTT GTT GAT ACC TGG CAG GCA GAT GCG 2832 SerGlu Ala Leu Glu Val Glu Val Val Asp Thr Trp Gln Ala Asp Ala 930 935 940GTA CGC GAT ACC CGT ACC CTT TCC GGC GGC GAA AGT TTC CTC GTT AGT 2880 ValArg Asp Thr Arg Thr Leu Ser Gly Gly Glu Ser Phe Leu Val Ser 945 950 955960 CTG GCG CTG GCG CTG GCG CTT TCG GAT CTG GTC AGC CAT AAA ACA CGT 2928Leu Ala Leu Ala Leu Ala Leu Ser Asp Leu Val Ser His Lys Thr Arg 965 970975 ATT GAC TCG CTG TTC CTT GAT GAA GGT TTT GGC ACG CTG GAT AGC GAA 2976Ile Asp Ser Leu Phe Leu Asp Glu Gly Phe Gly Thr Leu Asp Ser Glu 980 985990 ACG CTG GAT ACC GCC CTT GAT GCG CTG GAT GCC CTG AAC GCC AGT GGC 3024Thr Leu Asp Thr Ala Leu Asp Ala Leu Asp Ala Leu Asn Ala Ser Gly 995 10001005 AAA ACC ATC GGT GTG ATT AGC CAC GTA GAA GCG ATG AAA GAG CGT ATT3072 Lys Thr Ile Gly Val Ile Ser His Val Glu Ala Met Lys Glu Arg Ile1010 1015 1020 CCG GTG CAG ATC AAA GTG AAA AAG ATC AAC GGC CTG GGC TACAGC AAA 3120 Pro Val Gln Ile Lys Val Lys Lys Ile Asn Gly Leu Gly Tyr SerLys 1025 1030 1035 1040 CTG GAA AGT ACG TTT GCA GTG AAA TAA 3147 Leu GluSer Thr Phe Ala Val Lys 1045 1048 amino acids amino acid linear protein85 Met Lys Ile Leu Ser Leu Arg Leu Lys Asn Leu Asn Ser Leu Lys Gly 1 510 15 Glu Trp Lys Ile Asp Phe Thr Arg Glu Pro Phe Ala Ser Asn Gly Leu 2025 30 Phe Ala Ile Thr Gly Pro Thr Gly Ala Gly Lys Thr Thr Leu Leu Asp 3540 45 Ala Ile Cys Leu Ala Leu Tyr His Glu Thr Pro Arg Leu Ser Asn Val 5055 60 Ser Gln Ser Gln Asn Asp Leu Met Thr Arg Asp Thr Ala Glu Cys Leu 6570 75 80 Ala Glu Val Glu Phe Glu Val Lys Gly Glu Ala Tyr Arg Ala Phe Trp85 90 95 Ser Gln Asn Arg Ala Arg Asn Gln Pro Asp Gly Asn Leu Gln Val Pro100 105 110 Arg Val Glu Leu Ala Arg Cys Ala Asp Gly Lys Ile Leu Ala AspLys 115 120 125 Val Lys Asp Lys Leu Glu Leu Thr Ala Thr Leu Thr Gly LeuAsp Tyr 130 135 140 Gly Arg Phe Thr Arg Ser Met Leu Leu Ser Gln Gly GlnPhe Ala Ala 145 150 155 160 Phe Leu Asn Ala Lys Pro Lys Glu Arg Ala GluLeu Leu Glu Glu Leu 165 170 175 Thr Gly Thr Glu Ile Tyr Gly Gln Ile SerAla Met Val Phe Glu Gln 180 185 190 His Lys Ser Ala Arg Thr Glu Leu GluLys Leu Gln Ala Gln Ala Ser 195 200 205 Gly Val Thr Leu Leu Thr Pro GluGln Val Gln Ser Leu Thr Ala Ser 210 215 220 Leu Gln Val Leu Thr Asp GluGlu Lys Gln Leu Ile Thr Ala Gln Gln 225 230 235 240 Gln Glu Gln Gln SerLeu Asn Trp Leu Thr Arg Gln Asp Glu Leu Gln 245 250 255 Gln Glu Ala SerArg Arg Gln Gln Ala Leu Gln Gln Ala Leu Ala Glu 260 265 270 Glu Glu LysAla Gln Pro Gln Leu Ala Ala Leu Ser Leu Ala Gln Pro 275 280 285 Ala ArgAsn Leu Arg Pro His Trp Glu Arg Ile Ala Glu His Ser Ala 290 295 300 AlaLeu Ala His Ile Arg Gln Gln Ile Glu Glu Val Asn Thr Arg Leu 305 310 315320 Gln Ser Thr Met Ala Leu Arg Ala Ser Ile Arg His His Ala Ala Lys 325330 335 Gln Ser Ala Glu Leu Gln Gln Gln Gln Gln Ser Leu Asn Thr Trp Leu340 345 350 Gln Glu His Asp Arg Phe Arg Gln Trp Asn Asn Glu Pro Ala GlyTrp 355 360 365 Arg Ala Gln Phe Ser Gln Gln Thr Ser Asp Arg Glu His LeuArg Gln 370 375 380 Trp Gln Gln Gln Leu Thr His Ala Glu Gln Lys Leu AsnAla Leu Ala 385 390 395 400 Ala Ile Thr Leu Thr Leu Thr Ala Asp Glu ValAla Thr Ala Leu Ala 405 410 415 Gln His Ala Glu Gln Arg Pro Leu Arg GlnHis Leu Val Ala Leu His 420 425 430 Gly Gln Ile Val Pro Gln Gln Lys ArgLeu Ala Gln Leu Gln Val Ala 435 440 445 Ile Gln Asn Val Thr Gln Glu GlnThr Gln Arg Asn Ala Ala Leu Asn 450 455 460 Glu Met Arg Gln Arg Tyr LysGlu Lys Thr Gln Gln Leu Ala Asp Val 465 470 475 480 Lys Thr Ile Cys GluGln Glu Ala Arg Ile Lys Thr Leu Glu Ala Gln 485 490 495 Arg Ala Gln LeuGln Ala Gly Gln Pro Cys Pro Leu Cys Gly Ser Thr 500 505 510 Ser His ProAla Val Glu Ala Tyr Gln Ala Leu Glu Pro Gly Val Asn 515 520 525 Gln SerArg Leu Leu Ala Leu Glu Asn Glu Val Lys Lys Leu Gly Glu 530 535 540 GluGly Ala Thr Leu Arg Gly Gln Leu Asp Ala Ile Thr Lys Gln Leu 545 550 555560 Gln Arg Asp Glu Asn Glu Ala Gln Ser Leu Arg Gln Asp Glu Gln Ala 565570 575 Leu Thr Gln Gln Trp Gln Ala Val Thr Ala Ser Leu Asn Ile Thr Leu580 585 590 Gln Pro Leu Asp Asp Ile Gln Pro Trp Leu Asp Ala Gln Asp GluHis 595 600 605 Glu Arg Gln Leu Arg Leu Leu Ser Gln Arg His Glu Leu GlnGly Gln 610 615 620 Ile Ala Ala His Asn Gln Gln Ile Ile Gln Tyr Gln GlnGln Ile Glu 625 630 635 640 Gln Arg Gln Gln Leu Leu Leu Thr Thr Leu ThrGly Tyr Ala Leu Thr 645 650 655 Leu Pro Gln Glu Asp Glu Glu Glu Ser TrpLeu Ala Thr Arg Gln Gln 660 665 670 Glu Ala Gln Ser Trp Gln Gln Arg GlnAsn Glu Leu Thr Ala Leu Gln 675 680 685 Asn Arg Ile Gln Gln Leu Thr ProIle Leu Glu Thr Leu Pro Gln Ser 690 695 700 Asp Glu Leu Pro His Cys GluGlu Thr Val Val Leu Glu Asn Trp Arg 705 710 715 720 Gln Val His Glu GlnCys Leu Ala Leu His Ser Gln Gln Gln Thr Leu 725 730 735 Gln Gln Gln AspVal Leu Ala Ala Gln Ser Leu Gln Lys Ala Gln Ala 740 745 750 Gln Phe AspThr Ala Leu Gln Ala Ser Val Phe Asp Asp Gln Gln Ala 755 760 765 Phe LeuAla Ala Leu Met Asp Glu Gln Thr Leu Thr Gln Leu Glu Gln 770 775 780 LeuLys Gln Asn Leu Glu Asn Gln Arg Arg Gln Ala Gln Thr Leu Val 785 790 795800 Thr Gln Thr Ala Glu Thr Leu Ala Gln His Gln Gln His Arg Pro Asp 805810 815 Asp Gly Leu Ala Leu Thr Val Thr Val Glu Gln Ile Gln Gln Glu Leu820 825 830 Ala Gln Thr His Gln Lys Leu Arg Glu Asn Thr Thr Ser Gln GlyGlu 835 840 845 Ile Arg Gln Gln Leu Lys Gln Asp Ala Asp Asn Arg Gln GlnGln Gln 850 855 860 Thr Leu Met Gln Gln Ile Ala Gln Met Thr Gln Gln ValGlu Asp Trp 865 870 875 880 Gly Tyr Leu Asn Ser Leu Ile Gly Ser Lys GluGly Asp Lys Phe Arg 885 890 895 Lys Phe Ala Gln Gly Leu Thr Leu Asp AsnLeu Val His Leu Ala Asn 900 905 910 Gln Gln Leu Thr Arg Leu His Gly ArgTyr Leu Leu Gln Arg Lys Ala 915 920 925 Ser Glu Ala Leu Glu Val Glu ValVal Asp Thr Trp Gln Ala Asp Ala 930 935 940 Val Arg Asp Thr Arg Thr LeuSer Gly Gly Glu Ser Phe Leu Val Ser 945 950 955 960 Leu Ala Leu Ala LeuAla Leu Ser Asp Leu Val Ser His Lys Thr Arg 965 970 975 Ile Asp Ser LeuPhe Leu Asp Glu Gly Phe Gly Thr Leu Asp Ser Glu 980 985 990 Thr Leu AspThr Ala Leu Asp Ala Leu Asp Ala Leu Asn Ala Ser Gly 995 1000 1005 LysThr Ile Gly Val Ile Ser His Val Glu Ala Met Lys Glu Arg Ile 1010 10151020 Pro Val Gln Ile Lys Val Lys Lys Ile Asn Gly Leu Gly Tyr Ser Lys1025 1030 1035 1040 Leu Glu Ser Thr Phe Ala Val Lys 1045 1239 base pairsnucleic acid single linear other nucleic acid /desc = ”DNA (genomic)(p14b74)“ CDS 1..1236 86 ATG ATG AAA GAA AAA GTG ATT TTT CTC GTT GAC ATGCAA TCG TTT TAT 48 Met Met Lys Glu Lys Val Ile Phe Leu Val Asp Met GlnSer Phe Tyr 1 5 10 15 GCA TCT GTA GAG AAA GCG GAA AAT CCA CAT TTG AAAAAT AGG CCC GTC 96 Ala Ser Val Glu Lys Ala Glu Asn Pro His Leu Lys AsnArg Pro Val 20 25 30 ATT GTT TCG GGT GAC CCT GAA AAA AGG GGC GGA GTC GTATTG GCT GCC 144 Ile Val Ser Gly Asp Pro Glu Lys Arg Gly Gly Val Val LeuAla Ala 35 40 45 TGC CCG CTG GCG AAA CAA AAG GGT GTG GTG AAT GCT TCA CGGCTG TGG 192 Cys Pro Leu Ala Lys Gln Lys Gly Val Val Asn Ala Ser Arg LeuTrp 50 55 60 GAG GCG CAG GAA AAG TGT CCT GAG GCT GTT GTG CTC CGG CCG CGTATG 240 Glu Ala Gln Glu Lys Cys Pro Glu Ala Val Val Leu Arg Pro Arg Met65 70 75 80 CAG CGG TAT ATT GAT GTA TCA CTG CAA ATT ACG GCC ATT CTC GAGGAG 288 Gln Arg Tyr Ile Asp Val Ser Leu Gln Ile Thr Ala Ile Leu Glu Glu85 90 95 TAT ACA GAC CTT GTG GAG CCG TAT TCC ATC GAT GAA CAG TTC ATG GAC336 Tyr Thr Asp Leu Val Glu Pro Tyr Ser Ile Asp Glu Gln Phe Met Asp 100105 110 ATT ACA GGC AGC CAG AAG CTG TTT GGG ACG CCG ATG GAG ATC GCG AAA384 Ile Thr Gly Ser Gln Lys Leu Phe Gly Thr Pro Met Glu Ile Ala Lys 115120 125 AGC ATT CAG GGC AGA ATC ATG CGG GAG ATC GGC GTT TAT GCA CGG GTC432 Ser Ile Gln Gly Arg Ile Met Arg Glu Ile Gly Val Tyr Ala Arg Val 130135 140 GGA ATC GGC CCT AAC AAA GCG CTG GCC AAA ATT GCG TGT GAC AAT TTT480 Gly Ile Gly Pro Asn Lys Ala Leu Ala Lys Ile Ala Cys Asp Asn Phe 145150 155 160 GCC AAA AAG AAT AAG AAC GGT ATT TTT ACC TTA ACG AAA GAA AATATG 528 Ala Lys Lys Asn Lys Asn Gly Ile Phe Thr Leu Thr Lys Glu Asn Met165 170 175 AAA ACC GAA ATG TGG CCG CTC CCG GTG GGC AGC ATG TTT GGC GTCGGG 576 Lys Thr Glu Met Trp Pro Leu Pro Val Gly Ser Met Phe Gly Val Gly180 185 190 AGC CGC ATG AAG CAT CAT TTA AAT CGA ATG GGC ATC AGC ACG ATCGGC 624 Ser Arg Met Lys His His Leu Asn Arg Met Gly Ile Ser Thr Ile Gly195 200 205 GGG CTC GCG GCT TTT CCG CTC GAT CTT TTA AAA AAG AAA TGG GGCATT 672 Gly Leu Ala Ala Phe Pro Leu Asp Leu Leu Lys Lys Lys Trp Gly Ile210 215 220 AAC GGC CAC GTG CTG TGG ATG ACG GCA AAC GGA ATC GAC TAT TCCCCT 720 Asn Gly His Val Leu Trp Met Thr Ala Asn Gly Ile Asp Tyr Ser Pro225 230 235 240 GTG TCA ACT TCG TCT CTG GAC GGG CAA AAG GCG ATA GGT CATGGA ATG 768 Val Ser Thr Ser Ser Leu Asp Gly Gln Lys Ala Ile Gly His GlyMet 245 250 255 ACT CTC CCG AGA GAC TAC GAA CAC TTT GAC AAA GAA ATC AAGGTC GTA 816 Thr Leu Pro Arg Asp Tyr Glu His Phe Asp Lys Glu Ile Lys ValVal 260 265 270 TTG CTT GAG CTG AGT GAA GAG GTG TGC AGG CGA AGC CGA AACGCC GGG 864 Leu Leu Glu Leu Ser Glu Glu Val Cys Arg Arg Ser Arg Asn AlaGly 275 280 285 GTC ATG GGG CAG ACA GTG TCA GTG AGC TGC CGG GGT GCT GATTTT GAT 912 Val Met Gly Gln Thr Val Ser Val Ser Cys Arg Gly Ala Asp PheAsp 290 295 300 TGG CCG ACG GGC TTC AAC CGG CAA GTG AAG CTG GCA GAG CCGACT AAT 960 Trp Pro Thr Gly Phe Asn Arg Gln Val Lys Leu Ala Glu Pro ThrAsn 305 310 315 320 TCT ACG CAG GAT GTA TAT GAG GCT GTA CGA CGG CTG TTTCTT ACA TTT 1008 Ser Thr Gln Asp Val Tyr Glu Ala Val Arg Arg Leu Phe LeuThr Phe 325 330 335 TGG GAC GGG AAA CCC GTC CGC CGC CTC GGT GTC AAT CTGTCT CAG CTC 1056 Trp Asp Gly Lys Pro Val Arg Arg Leu Gly Val Asn Leu SerGln Leu 340 345 350 TCA TCT GAT GAC ATA TGG CAG CTC AAT TTA TTT CAG GATTAT GCA AAG 1104 Ser Ser Asp Asp Ile Trp Gln Leu Asn Leu Phe Gln Asp TyrAla Lys 355 360 365 AAA ATG AGC CTA GGC TAT GTG ATG GAT GGC ATT AAA AATCGA TTC GGC 1152 Lys Met Ser Leu Gly Tyr Val Met Asp Gly Ile Lys Asn ArgPhe Gly 370 375 380 GAT ACA GCA ATC ATC AGG GCG GCG TCA CTG ACA GCG GCAGGC CAG GCA 1200 Asp Thr Ala Ile Ile Arg Ala Ala Ser Leu Thr Ala Ala GlyGln Ala 385 390 395 400 TTT GAA CGT GCG GCT AAA ATA GGG GGG CAT TAT AAATGA 1239 Phe Glu Arg Ala Ala Lys Ile Gly Gly His Tyr Lys 405 410 412amino acids amino acid linear protein 87 Met Met Lys Glu Lys Val Ile PheLeu Val Asp Met Gln Ser Phe Tyr 1 5 10 15 Ala Ser Val Glu Lys Ala GluAsn Pro His Leu Lys Asn Arg Pro Val 20 25 30 Ile Val Ser Gly Asp Pro GluLys Arg Gly Gly Val Val Leu Ala Ala 35 40 45 Cys Pro Leu Ala Lys Gln LysGly Val Val Asn Ala Ser Arg Leu Trp 50 55 60 Glu Ala Gln Glu Lys Cys ProGlu Ala Val Val Leu Arg Pro Arg Met 65 70 75 80 Gln Arg Tyr Ile Asp ValSer Leu Gln Ile Thr Ala Ile Leu Glu Glu 85 90 95 Tyr Thr Asp Leu Val GluPro Tyr Ser Ile Asp Glu Gln Phe Met Asp 100 105 110 Ile Thr Gly Ser GlnLys Leu Phe Gly Thr Pro Met Glu Ile Ala Lys 115 120 125 Ser Ile Gln GlyArg Ile Met Arg Glu Ile Gly Val Tyr Ala Arg Val 130 135 140 Gly Ile GlyPro Asn Lys Ala Leu Ala Lys Ile Ala Cys Asp Asn Phe 145 150 155 160 AlaLys Lys Asn Lys Asn Gly Ile Phe Thr Leu Thr Lys Glu Asn Met 165 170 175Lys Thr Glu Met Trp Pro Leu Pro Val Gly Ser Met Phe Gly Val Gly 180 185190 Ser Arg Met Lys His His Leu Asn Arg Met Gly Ile Ser Thr Ile Gly 195200 205 Gly Leu Ala Ala Phe Pro Leu Asp Leu Leu Lys Lys Lys Trp Gly Ile210 215 220 Asn Gly His Val Leu Trp Met Thr Ala Asn Gly Ile Asp Tyr SerPro 225 230 235 240 Val Ser Thr Ser Ser Leu Asp Gly Gln Lys Ala Ile GlyHis Gly Met 245 250 255 Thr Leu Pro Arg Asp Tyr Glu His Phe Asp Lys GluIle Lys Val Val 260 265 270 Leu Leu Glu Leu Ser Glu Glu Val Cys Arg ArgSer Arg Asn Ala Gly 275 280 285 Val Met Gly Gln Thr Val Ser Val Ser CysArg Gly Ala Asp Phe Asp 290 295 300 Trp Pro Thr Gly Phe Asn Arg Gln ValLys Leu Ala Glu Pro Thr Asn 305 310 315 320 Ser Thr Gln Asp Val Tyr GluAla Val Arg Arg Leu Phe Leu Thr Phe 325 330 335 Trp Asp Gly Lys Pro ValArg Arg Leu Gly Val Asn Leu Ser Gln Leu 340 345 350 Ser Ser Asp Asp IleTrp Gln Leu Asn Leu Phe Gln Asp Tyr Ala Lys 355 360 365 Lys Met Ser LeuGly Tyr Val Met Asp Gly Ile Lys Asn Arg Phe Gly 370 375 380 Asp Thr AlaIle Ile Arg Ala Ala Ser Leu Thr Ala Ala Gly Gln Ala 385 390 395 400 PheGlu Arg Ala Ala Lys Ile Gly Gly His Tyr Lys 405 410 441 base pairsnucleic acid single linear other nucleic acid /desc = ”DNA (genomic)(p14c13)“ 88 TAAGGGGACT TATATAACTG TATGTTTGTG TAGTGTTTAT GTCAGTAAGCTAAATTTACA 60 TTCATGTTAT GTYTCATTAA ACCAATTACT CACGTTTTGG TGCATATTAAATCTTTTATA 120 TCGATCATAC ATCTATCATC ATTTTTATTT CTAAAATACA AACTGAATACTTTGCTAGAA 180 TTTGTTACAG CAATCATTGC TCATGCATTT TATAAATTAC AATTAGACAAATATGACATT 240 TGATATCACA CACTTCAAAC ACACACATAT ATAATCAGAC ATAAATTGTTATGCTAAGGG 300 TTTATTCACC AAAAATATAA TACATATGGG CTTGTTTTGA GTCCATATTGAATGAATTAA 360 AAAGTATACT CCACTCAATC ATTTACAAAT AGGTGGTGCC ACTCCNAATTTATTTTATGG 420 CCAACCCCAA ATATGAGAAC A 441 909 base pairs nucleic acidsingle linear other nucleic acid /desc = ”DNA (genomic) (p15b9)“ CDS1..906 89 ATG AAC CAA AAA GGC AGA GGG CTT GAG ATC CTC ATC AAT GAA AAACAG 48 Met Asn Gln Lys Gly Arg Gly Leu Glu Ile Leu Ile Asn Glu Lys Gln 15 10 15 GAC GGC CAA TGG CTG TTT TCC GTA CTC AAA ACA GCG CTC AAA GCT TCT96 Asp Gly Gln Trp Leu Phe Ser Val Leu Lys Thr Ala Leu Lys Ala Ser 20 2530 AAA CCA GTG ATA CAA GAC TGG ATG TCC CAT CAA CAG ATA AAG GTC AAT 144Lys Pro Val Ile Gln Asp Trp Met Ser His Gln Gln Ile Lys Val Asn 35 40 45CAC GAA TCC GTC TTA AAC AAT ATG ATT GTA AAA AAG GGA GAC CGC GTG 192 HisGlu Ser Val Leu Asn Asn Met Ile Val Lys Lys Gly Asp Arg Val 50 55 60 TTCATT GAT CTT CAG GAA AGT GAA GCA TCT TCG GTC ATT CCG GAG TAT 240 Phe IleAsp Leu Gln Glu Ser Glu Ala Ser Ser Val Ile Pro Glu Tyr 65 70 75 80 GGCGAG CTT GAT ATT TTA TTT GAG GAC AAT CAT ATG CTC ATC ATC AAT 288 Gly GluLeu Asp Ile Leu Phe Glu Asp Asn His Met Leu Ile Ile Asn 85 90 95 AAA CCCGCT GGC ATC GCG ACG CAT CCG AAT GAG GAT GGG CAA ACC GGC 336 Lys Pro AlaGly Ile Ala Thr His Pro Asn Glu Asp Gly Gln Thr Gly 100 105 110 ACA CTGGCT AAT TTG ATC GCG TAT CAT TAT CAG ATA AAT GGC GAA ACA 384 Thr Leu AlaAsn Leu Ile Ala Tyr His Tyr Gln Ile Asn Gly Glu Thr 115 120 125 TGT AAGGTG CGC CAC GTC CAT CGT CTT GAT CAG GAT ACA TCT GGC GCT 432 Cys Lys ValArg His Val His Arg Leu Asp Gln Asp Thr Ser Gly Ala 130 135 140 ATC GTTTTT GCC AAG CAT CGT TTG GCA CAC GCC ATC TTA GAC CAG CAG 480 Ile Val PheAla Lys His Arg Leu Ala His Ala Ile Leu Asp Gln Gln 145 150 155 160 TTAGAG AAA AAG ACG CTG AAG CGT ACG TAT ACC GCT ATC GCT GAA GGT 528 Leu GluLys Lys Thr Leu Lys Arg Thr Tyr Thr Ala Ile Ala Glu Gly 165 170 175 AAGCTA CGG ACG AAA AAA GGG ACA ATT AAT CCA CCG ATC GGC AGA GAC 576 Lys LeuArg Thr Lys Lys Gly Thr Ile Asn Pro Pro Ile Gly Arg Asp 180 185 190 CGCTCA CAC CCG ACA AGA CGC CGG GTT TCA CCA GGC GGG CAA ACA GCC 624 Arg SerHis Pro Thr Arg Arg Arg Val Ser Pro Gly Gly Gln Thr Ala 195 200 205 GTCACT CAT TTC AAG GTA ATG GCC AGC AAT GCG AAA GAA CGG CTG TCG 672 Val ThrHis Phe Lys Val Met Ala Ser Asn Ala Lys Glu Arg Leu Ser 210 215 220 CTCGTT GAA TTA GAG CTG GAA ACA GGC AGA ACA CAC CAA ATC CGT GTT 720 Leu ValGlu Leu Glu Leu Glu Thr Gly Arg Thr His Gln Ile Arg Val 225 230 235 240CAT CTG GCG AGC CTC GGC CAT CCG TTG ACA GGA GAC TCG CTT TAC GGA 768 HisLeu Ala Ser Leu Gly His Pro Leu Thr Gly Asp Ser Leu Tyr Gly 245 250 255GGC GGG AGC AAG CTG CTA AAC AGG CAG GCA CTG CAC GCC AAT AAA GTA 816 GlyGly Ser Lys Leu Leu Asn Arg Gln Ala Leu His Ala Asn Lys Val 260 265 270CAA GCG GTT CAC CCG ATA ACA GAC GAG CTC ATA GTT GCT GAA GCC CCT 864 GlnAla Val His Pro Ile Thr Asp Glu Leu Ile Val Ala Glu Ala Pro 275 280 285TTC CCT GCT GAT ATG AAA AAC CTT TGC CGC ACA TAT TTT TCA 906 Phe Pro AlaAsp Met Lys Asn Leu Cys Arg Thr Tyr Phe Ser 290 295 300 TGA 909 302amino acids amino acid linear protein 90 Met Asn Gln Lys Gly Arg Gly LeuGlu Ile Leu Ile Asn Glu Lys Gln 1 5 10 15 Asp Gly Gln Trp Leu Phe SerVal Leu Lys Thr Ala Leu Lys Ala Ser 20 25 30 Lys Pro Val Ile Gln Asp TrpMet Ser His Gln Gln Ile Lys Val Asn 35 40 45 His Glu Ser Val Leu Asn AsnMet Ile Val Lys Lys Gly Asp Arg Val 50 55 60 Phe Ile Asp Leu Gln Glu SerGlu Ala Ser Ser Val Ile Pro Glu Tyr 65 70 75 80 Gly Glu Leu Asp Ile LeuPhe Glu Asp Asn His Met Leu Ile Ile Asn 85 90 95 Lys Pro Ala Gly Ile AlaThr His Pro Asn Glu Asp Gly Gln Thr Gly 100 105 110 Thr Leu Ala Asn LeuIle Ala Tyr His Tyr Gln Ile Asn Gly Glu Thr 115 120 125 Cys Lys Val ArgHis Val His Arg Leu Asp Gln Asp Thr Ser Gly Ala 130 135 140 Ile Val PheAla Lys His Arg Leu Ala His Ala Ile Leu Asp Gln Gln 145 150 155 160 LeuGlu Lys Lys Thr Leu Lys Arg Thr Tyr Thr Ala Ile Ala Glu Gly 165 170 175Lys Leu Arg Thr Lys Lys Gly Thr Ile Asn Pro Pro Ile Gly Arg Asp 180 185190 Arg Ser His Pro Thr Arg Arg Arg Val Ser Pro Gly Gly Gln Thr Ala 195200 205 Val Thr His Phe Lys Val Met Ala Ser Asn Ala Lys Glu Arg Leu Ser210 215 220 Leu Val Glu Leu Glu Leu Glu Thr Gly Arg Thr His Gln Ile ArgVal 225 230 235 240 His Leu Ala Ser Leu Gly His Pro Leu Thr Gly Asp SerLeu Tyr Gly 245 250 255 Gly Gly Ser Lys Leu Leu Asn Arg Gln Ala Leu HisAla Asn Lys Val 260 265 270 Gln Ala Val His Pro Ile Thr Asp Glu Leu IleVal Ala Glu Ala Pro 275 280 285 Phe Pro Ala Asp Met Lys Asn Leu Cys ArgThr Tyr Phe Ser 290 295 300 471 base pairs nucleic acid single linearother nucleic acid /desc = ”DNA (genomic) (p15b32)“ CDS 1..468 91 ATGGAT GAG TTG CAG CAG GTC TCC GAT GGC TGG CTT GCG CAC CAC AAT 48 Met AspGlu Leu Gln Gln Val Ser Asp Gly Trp Leu Ala His His Asn 1 5 10 15 ACGCGG GAA AAG CGC TTC TCG CTT GGC GCT TTC GAA CCG GAC TAT ATT 96 Thr ArgGlu Lys Arg Phe Ser Leu Gly Ala Phe Glu Pro Asp Tyr Ile 20 25 30 CTG TCGCAA CCC GTC GCC GTG CTG CGC AAG GAT GGA AAA ATC ACC GCC 144 Leu Ser GlnPro Val Ala Val Leu Arg Lys Asp Gly Lys Ile Thr Ala 35 40 45 TTC GCC AATCTG ATG GTG ACG GAG ACG AAA AAG GAA GCC ACC ATC GAC 192 Phe Ala Asn LeuMet Val Thr Glu Thr Lys Lys Glu Ala Thr Ile Asp 50 55 60 CTC ATG CGC TTTTCG GCG GAT GCG CGC GCG GCT CGA TGG ATT TCC TCT 240 Leu Met Arg Phe SerAla Asp Ala Arg Ala Ala Arg Trp Ile Ser Ser 65 70 75 80 TCG TCA GCA TCATGC AGC ATC TGC GCG AGG CGG GAT ATG AAA GCT TCA 288 Ser Ser Ala Ser CysSer Ile Cys Ala Arg Arg Asp Met Lys Ala Ser 85 90 95 ATC TCG GCA TGG CGCCCA TGT CCG GCA TGT CGA AGC GCG ATG CCG CGC 336 Ile Ser Ala Trp Arg ProCys Pro Ala Cys Arg Ser Ala Met Pro Arg 100 105 110 CGG TCT GGG ACC GTATCG GCA GCA CGC TGT TCG AGC ACG GCG AAC GTT 384 Arg Ser Gly Thr Val SerAla Ala Arg Cys Ser Ser Thr Ala Asn Val 115 120 125 TCT ACA ACT TCA AGGGAC TTC GCG CCT TCA AGG CAA AGT TCC ACC CGA 432 Ser Thr Thr Ser Arg AspPhe Ala Pro Ser Arg Gln Ser Ser Thr Arg 130 135 140 AAT GGG AAC CCC GTTACC TTG CTG TGC AGA ACG GCG TGA 471 Asn Gly Asn Pro Val Thr Leu Leu CysArg Thr Ala 145 150 155 156 amino acids amino acid linear protein 92 MetAsp Glu Leu Gln Gln Val Ser Asp Gly Trp Leu Ala His His Asn 1 5 10 15Thr Arg Glu Lys Arg Phe Ser Leu Gly Ala Phe Glu Pro Asp Tyr Ile 20 25 30Leu Ser Gln Pro Val Ala Val Leu Arg Lys Asp Gly Lys Ile Thr Ala 35 40 45Phe Ala Asn Leu Met Val Thr Glu Thr Lys Lys Glu Ala Thr Ile Asp 50 55 60Leu Met Arg Phe Ser Ala Asp Ala Arg Ala Ala Arg Trp Ile Ser Ser 65 70 7580 Ser Ser Ala Ser Cys Ser Ile Cys Ala Arg Arg Asp Met Lys Ala Ser 85 9095 Ile Ser Ala Trp Arg Pro Cys Pro Ala Cys Arg Ser Ala Met Pro Arg 100105 110 Arg Ser Gly Thr Val Ser Ala Ala Arg Cys Ser Ser Thr Ala Asn Val115 120 125 Ser Thr Thr Ser Arg Asp Phe Ala Pro Ser Arg Gln Ser Ser ThrArg 130 135 140 Asn Gly Asn Pro Val Thr Leu Leu Cys Arg Thr Ala 145 150155 192 base pairs nucleic acid single linear other nucleic acid /desc =”DNA (genomic) (p15c4)“ CDS 1..189 93 ATG CCT ATT GCC CAG ATC CAC ATCCTT GAA GGC CGC AGC GAC GAG CAG 48 Met Pro Ile Ala Gln Ile His Ile LeuGlu Gly Arg Ser Asp Glu Gln 1 5 10 15 AAG GAA ACC CTG ATT CGG GAA GTCAGT GAG GCC ATC TCG CGC TCC CTG 96 Lys Glu Thr Leu Ile Arg Glu Val SerGlu Ala Ile Ser Arg Ser Leu 20 25 30 GAT GCG CCG CTG ACC AGC GTG CGA GTGATT ATC ACG GAG ATG GCC AAG 144 Asp Ala Pro Leu Thr Ser Val Arg Val IleIle Thr Glu Met Ala Lys 35 40 45 GGC CAC TTC GGC ATC GGC GGC GAA CTG GCCAGC AAG GTC AGA CGC 189 Gly His Phe Gly Ile Gly Gly Glu Leu Ala Ser LysVal Arg Arg 50 55 60 TGA 192 63 amino acids amino acid linear protein 94Met Pro Ile Ala Gln Ile His Ile Leu Glu Gly Arg Ser Asp Glu Gln 1 5 1015 Lys Glu Thr Leu Ile Arg Glu Val Ser Glu Ala Ile Ser Arg Ser Leu 20 2530 Asp Ala Pro Leu Thr Ser Val Arg Val Ile Ile Thr Glu Met Ala Lys 35 4045 Gly His Phe Gly Ile Gly Gly Glu Leu Ala Ser Lys Val Arg Arg 50 55 6030 base pairs nucleic acid single linear other nucleic acid /desc =”primer“ 95 GGAGCTCACT AGTCGGAGGC ATCAGTGACC 30 27 base pairs nucleicacid single linear other nucleic acid /desc = ”primer“ 96 GGGATCCCATGAGAATTCTT GAAGACG 27 89 base pairs nucleic acid single linear othernucleic acid /desc = ”oligonucleotide“ 97 CTAGGTACCT ACAACCTCAAGCTTNKNKNK NKNKNKNKNK NKNKNKNKNK NKNKNKNKNK 60 NKNKAAGCTT GGTTAGAATGGGTACCATG 89 24 base pairs nucleic acid single linear other nucleic acid/desc = ”primer“ 98 CTAGAATTCT ACAACCTCAA GCTT 24 25 base pairs nucleicacid single linear other nucleic acid /desc = ”primer“ 99 AAGCTTGGTTAGAATGGAAT TCATG 25 15 base pairs nucleic acid single linear othernucleic acid /desc = ”primer“ 100 GAATTCCATT CTAAC 15 18 base pairsnucleic acid single linear other nucleic acid /desc = ”primer“ 101ATTCCATTCT AACCAAGC 18 21 base pairs nucleic acid single linear othernucleic acid /desc = ”primer“ 102 TGAACTGCCA CTGTAGAGAG A 21 40 basepairs nucleic acid single linear other nucleic acid /desc = ”primer“ 103GGCCATCGAT AATGAAATTA ATTAACGAGA GACAAATAGG 40 55 base pairs nucleicacid single linear other nucleic acid /desc = ”primer“ 104 GGCCGGATCCCTAGTGATGG TGATGGTGAT GAAAAATTCT GTCTTTAACT TTTTT 55 36 base pairsnucleic acid single linear other nucleic acid /desc = ”primer“ 105GGCCGGTACC AAATTAATTA ACGAGAGACA AATAGG 36 55 base pairs nucleic acidsingle linear other nucleic acid /desc = ”primer“ 106 GGCCGGATCCCTAGTGATGG TGATGGTGAT GAAAAATTCT GTCTTTAACT TTTTT 55 33 base pairsnucleic acid single linear other nucleic acid /desc = ”primer“ 107TGGAGATCTA AGCTTTGCAT AACTTTCTCG TCC 33 18 base pairs nucleic acidsingle linear other nucleic acid /desc = ”primer“ 108 TCCTGGCGATTCTGAGAC 18 27 base pairs nucleic acid single linear other nucleic acid/desc = ”primer“ 109 TGGGGATCCG ATAAGTGTGA CTGGTAG 27 29 base pairsnucleic acid single linear other nucleic acid /desc = ”primer“ 110TGGAAGCTTA CATTACTTCA AATAAATTA 29 31 base pairs nucleic acid singlelinear other nucleic acid /desc = ”primer“ 111 TGGGGATCCT GCATATCAAAATGTTTATGG C 31 28 base pairs nucleic acid single linear other nucleicacid /desc = ”primer“ 112 TGGAAGCTTA CACATATGCC AATCTCAC 28 28 basepairs nucleic acid single linear other nucleic acid /desc = ”primer“ 113GTTGGATCCG CTGTTGTTAC TTTGATGC 28 29 base pairs nucleic acid singlelinear other nucleic acid /desc = ”primer“ 114 TGGAAGCTTA CATTACTTCAAATAAATTA 29 28 base pairs nucleic acid single linear other nucleic acid/desc = ”primer“ 115 GTTGGATCCG CTGTTGTTAC TTTGATGC 28 28 base pairsnucleic acid single linear other nucleic acid /desc = ”primer“ 116TGGAAGCTTA CACATATGCC AATCTCAC 28 20 base pairs nucleic acid singlelinear other nucleic acid /desc = ”primer“ 117 ATTTGATATG TCTCAACTGC 2020 base pairs nucleic acid single linear other nucleic acid /desc =”primer“ 118 GCTCTAATTT TTAAGTGAGG 20 21 base pairs nucleic acid singlelinear other nucleic acid /desc = ”primer“ 119 TATCTGGTGG CGTAACACCT G21 21 base pairs nucleic acid single linear other nucleic acid /desc =”primer“ 120 GATGAAGCCG TTACGTTGTT C 21 22 base pairs nucleic acidsingle linear other nucleic acid /desc = ”primer“ 121 GCCATAAGGATGTGAATGTA TG 22 19 base pairs nucleic acid single linear other nucleicacid /desc = ”primer“ 122 GCATTTGCTA GTTATCTTG 19 24 base pairs nucleicacid single linear other nucleic acid /desc = ”primer“ 123 AGATCTATCAAGGATGTGAT GGTT 24 24 base pairs nucleic acid single linear othernucleic acid /desc = ”primer“ 124 GTCATTATAC GATTTAGTAC AATC 24 124amino acids amino acid single linear protein NO NO 125 Met Lys Asp GluGln Leu Tyr Tyr Phe Glu Lys Ser Pro Val Phe Lys 1 5 10 15 Ala Met MetHis Phe Ser Leu Pro Met Met Ile Gly Thr Leu Leu Ser 20 25 30 Val Ile TyrGly Ile Leu Asn Tyr Ile Phe Ile Gly Phe Ser Glu Glu 35 40 45 Ser His MetIle Ser Ala Ile Ser Leu Thr Leu Pro Val Phe Ala Ile 50 55 60 Leu Met GlyLeu Gly Asn Leu Phe Gly Val Gly Ala Gly Thr Tyr Ile 65 70 75 80 Ser ArgLeu Leu Gly Ala Lys Asp Tyr Ser Lys Ser Lys Phe Val Ser 85 90 95 Ser PheSer Ile Tyr Gly Gly Ile Ala Leu Gly Leu Ile Val Ile Leu 100 105 110 ValThr Leu Pro Phe Ser Asp Gln Ser Gln Gln Phe 115 120 1500 base pairsnucleic acid single linear cDNA to mRNA NO NO 126 GTGTGACTGG TAGAAATCAGTCACTTTGTC TTTAATATTA TAGTTAGATA TCTAATTGTT 60 AGTAAGCTAA TTATTGGAAAAGACAAGGAG TATTGAACAA TGAAAGACGA ACAATTATAT 120 TATTTTGAGA AATCGCCAGTATTTAAAGCG ATGATGCATT TCTCATTGCC AATGATGATA 180 GGGACTTTAT TAAGCGTTATTTATGGCATA TTAAATATTT ACTTTATAGG ATTTTTAGAA 240 GATAGCCACA TGATTTCTGCTATCTCTCTA ACACTGCCAG TATTTGCTAT CTTAATGGGG 300 TTAGGTAATT TATTTGGCGTTGGTGCAGGA ACTTATATTT CACGTTTATT AGGTGCGAAA 360 GACTATAGTA AGAGTAAATTTGTAAGTAGT TTCTCTATTT ATGGTGGTAT TGCACTAGGA 420 CTTATCGTGA TTTTAGTTACTTTACCATTC AGTGATCAAA TCGCAGCAAT TTTAGGGGCG 480 AGAGGTGAAA CGTTAGCTTTAACAAGTAAT TATTTGAAAG TAATGTTTTT AAGTGCACCT 540 TTTGTAATTT TGTTCTTCATATTAGAACAA TTTGCACGTG CAATTGGGGC ACCAATGGTT 600 TCTATGATTG GTATGTTAGCTAGTGTAGGC TTAAATATTA TTTTAGATCC AATTTTAATT 660 TTTGGTTTTG ATTTAAACGTTGTTGGTGCA GCTTTGGGTA CTGCAATCAG TAATGTTGCT 720 GCTGCTCTGT TCTTTATCATTTATTTTATG AAAAATAGTG ACGTTGTGTC AGTTAATATT 780 AAACTTGCGA AACCTAATAAAGAAATGCTT TCTGAAATCT TTAAAATCGG TATTCCTGCA 840 TTTTTAATGA GTATCTTAATGGGATTCACA GGATTAGTTT TAAATTTATT TTTAGCACAT 900 TATGGAAACT TCGCGATTGCAAGTTATGGT ATCTCATTTA GACTTGTGCA ATTTCCAGAA 960 CTTATTATCA TGGGATTATGTGAAGGTGTT GTACCACTAA TTGCATATAA CTTTATGGCA 1020 AATAAAGGCC GTATGAAAGACGTTATCAAA GCAGTTATCA TGTCTATCGG CGTTATCTTT 1080 GTTGTATGTA TGAGTGCTGTATTTACAATT GGACATCATA TGGTCGGACT ATTTACTACT 1140 GATCAAGCCA TTGTTGAGATGGCGACATTT ATTTTGAAAG TAACAATGGC ATCATTATTA 1200 TTAAATGGTA TAGGTTTCTTGTTTACTGGT ATGCTTCAAG CGACTGGGCA AGGTCGTGGT 1260 GCTACAATTA TGGCCATTTTACAAGGTGCA ATTATCATTC CAGTATTATT TATTATGAAT 1320 GCTTTGTTTG GACTAACAGGTGTCATTTGG TCATTATTAA TTGCTGAGTC ACTTTGTGCT 1380 TTAGCAGCAA TGTTAATCGTCTATTTATTA CGTGATCGTT TGACAGTTGA TACATCTGAA 1440 TTAATAGAAG GTTAAATATTTCGTCCACTT CTGGCTGAGT ATATTTCGGT CGGAAGTGTA 1500 451 amino acids aminoacid single linear protein NO NO 127 Met Lys Asp Glu Gln Leu Tyr Tyr PheGlu Lys Ser Pro Val Phe Lys 1 5 10 15 Ala Met Met His Phe Ser Leu ProMet Met Ile Gly Thr Leu Leu Ser 20 25 30 Val Ile Tyr Gly Ile Leu Asn IleTyr Phe Ile Gly Phe Leu Glu Asp 35 40 45 Ser His Met Ile Ser Ala Ile SerLeu Thr Leu Pro Val Phe Ala Ile 50 55 60 Leu Met Gly Leu Gly Asn Leu PheGly Val Gly Ala Gly Thr Tyr Ile 65 70 75 80 Ser Arg Leu Leu Gly Ala LysAsp Tyr Ser Lys Ser Lys Phe Val Ser 85 90 95 Ser Phe Ser Ile Tyr Gly GlyIle Ala Leu Gly Leu Ile Val Ile Leu 100 105 110 Val Thr Leu Pro Phe SerAsp Gln Ile Ala Ala Ile Leu Gly Ala Arg 115 120 125 Gly Glu Thr Leu AlaLeu Thr Ser Asn Tyr Leu Lys Val Met Phe Leu 130 135 140 Ser Ala Pro PheVal Ile Leu Phe Phe Ile Leu Glu Gln Phe Ala Arg 145 150 155 160 Ala IleGly Ala Pro Met Val Ser Met Ile Gly Met Leu Ala Ser Val 165 170 175 GlyLeu Asn Ile Ile Leu Asp Pro Ile Leu Ile Phe Gly Phe Asp Leu 180 185 190Asn Val Val Gly Ala Ala Leu Gly Thr Ala Ile Ser Asn Val Ala Ala 195 200205 Ala Leu Phe Phe Ile Ile Tyr Phe Met Lys Asn Ser Asp Val Val Ser 210215 220 Val Asn Ile Lys Leu Ala Lys Pro Asn Lys Glu Met Leu Ser Glu Ile225 230 235 240 Phe Lys Ile Gly Ile Pro Ala Phe Leu Met Ser Ile Leu MetGly Phe 245 250 255 Thr Gly Leu Val Leu Asn Leu Phe Leu Ala His Tyr GlyAsn Phe Ala 260 265 270 Ile Ala Ser Tyr Gly Ile Ser Phe Arg Leu Val GlnPhe Pro Glu Leu 275 280 285 Ile Ile Met Gly Leu Cys Glu Gly Val Val ProLeu Ile Ala Tyr Asn 290 295 300 Phe Met Ala Asn Lys Gly Arg Met Lys AspVal Ile Lys Ala Val Ile 305 310 315 320 Met Ser Ile Gly Val Ile Phe ValVal Cys Met Ser Ala Val Phe Thr 325 330 335 Ile Gly His His Met Val GlyLeu Phe Thr Thr Asp Gln Ala Ile Val 340 345 350 Glu Met Ala Thr Phe IleLeu Lys Val Thr Met Ala Ser Leu Leu Leu 355 360 365 Asn Gly Ile Gly PheLeu Phe Thr Gly Met Leu Gln Ala Thr Gly Gln 370 375 380 Gly Arg Gly AlaThr Ile Met Ala Ile Leu Gln Gly Ala Ile Ile Ile 385 390 395 400 Pro ValLeu Phe Ile Met Asn Ala Leu Phe Gly Leu Thr Gly Val Ile 405 410 415 TrpSer Leu Leu Ile Ala Glu Ser Leu Cys Ala Leu Ala Ala Met Leu 420 425 430Ile Val Tyr Leu Leu Arg Asp Arg Leu Thr Val Asp Thr Ser Glu Leu 435 440445 Ile Glu Gly 450

What is claimed is:
 1. A method of screening for an anti-bacterial agentcomprising the steps of: (a) contacting a bacterial virulence proteinencoded by a DNA sequence set forth in SEQ ID NO: 41, (corresponding tosignature tag identification numbers P10B2) with a chemical compound,wherein said chemical compound is a candidate anti-bacterial agent, and(b) identifying said chemical compound as an agent that interferes withthe function of said bacterial virulence protein when said chemicalcompound binds to or interacts with said bacterial virulence protein. 2.The method of claim 1 wherein said bacterial virulence protein isrepresented by SEQ ID NO:42, (corresponding to signature tagidentification numbers P10B2).
 3. The method of claim 1 or 2 whereinsaid bacterial virulence protein is an enzyme and in step (b) binding orinteraction of said chemical compound with said bacterial virulenceprotein inhibits enzymatic activity of said bacterial virulence protein.4. The method of claim 1 or 2 wherein said chemical compound is apolypeptide and step (b) comprises a two-hybrid screening assaydetecting expression of a reporter gene in a host cell that comprises:(a) said bacterial virulence protein fused to a DNA-binding domain thatinteracts with an upstream activation sequence (UAS), (b) said chemicalcompound fused to a transcription activation domain, and (c) said UASoperatively linked to said reporter gene, and wherein expression of saidreporter gene occurs upon interaction of said bacterial virulenceprotein with said chemical compound.
 5. The method of claim 1 or 2wherein step (b) comprises a binding assay wherein unfolding of saidbacterial virulence protein is measured in the presence and absence ofsaid chemical compound.
 6. The method of claim 1 or 2 wherein step (b)comprises an affinity ultrafiltration assay wherein ultrafiltration isused to separate chemical compounds that bind to said bacterialvirulence protein from chemical compounds that do not bind to saidbacterial virulence protein.
 7. The method of claim 1 or 2 wherein saidchemical compound is a polypeptide and step (b) comprises a two-hybridscreening assay detecting expression of a reporter gene inma host cellthat comprises: (a) said chemical compound fused to a DNA-binding domainthat interacts with an upstream activation sequence (UAS), (b) saidbacterial virulence protein fused to a transcription activation domain,and (c) said UAS operatively linked to said reporter gene, and whereinexpression of said reporter gene occurs upon interaction of saidbacterial virulence protein with said chemical compound.
 8. The methodof claim 1 or 2 further comprising the step of measuring the effect ofsaid identified agent on bacterial replication.
 9. The method of claim 1or 2 further comprising the step of measuring the effect of saididentified agent on bacterial infection in an animal.